A PSO Strategy of Finding Relevant Web Documents using a New Similarity Measure

03/26/2021
by   Dr. Ramya C, et al.
0

In the world of the Internet and World Wide Web, which offers a tremendous amount of information, an increasing emphasis is being given to searching services and functionality. Currently, a majority of web portals offer their searching utilities, be it better or worse. These can search for the content within the sites, mainly text the textual content of documents. In this paper a novel similarity measure called SMDR (Similarity Measure for Documents Retrieval) is proposed to help retrieve more similar documents from the repository thus contributing considerably to the effectiveness of Web Information Retrieval (WIR) process. Bio-inspired PSO methodology is used with the intent to reduce the response time of the system and optimizes WIR process, hence contributes to the efficiency of the system. This paper also demonstrates a comparative study of the proposed system with the existing method in terms of accuracy, sensitivity, F-measure and specificity. Finally, extensive experiments are conducted on CACM collections. Better precision-recall rates are achieved than the existing system. Experimental results demonstrate the effectiveness and efficiency of the proposed system.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/13/2022

Web-Based File Clustering and Indexing for Mindoro State University

The Web Based File Clustering and Indexing for Mindoro State University ...
research
12/10/2020

An Integrated Search Framework for Leveraging the Knowledge-Based Web Ecosystem

The explosion of information constrains the judgement of search terms as...
research
01/10/2012

Sentence based semantic similarity measure for blog-posts

Blogs-Online digital diary like application on web 2.0 has opened new an...
research
03/04/2021

The effects of having lists of synonyms on the performance of Afaan Oromo Text Retrieval system

Obtaining relevant information from a collection of informational resour...
research
08/01/2020

Cluster-Based Information Retrieval by using (K-means)- Hierarchical Parallel Genetic Algorithms Approach

Cluster-based information retrieval is one of the Information retrieval(...
research
11/22/2022

Method for Determining the Similarity of Text Documents for the Kazakh language, Taking Into Account Synonyms: Extension to TF-IDF

The task of determining the similarity of text documents has received co...
research
04/27/2020

SFTM: Fast Comparison of Web Documents using Similarity-based Flexible Tree Matching

Tree matching techniques have been investigated in many fields, includin...

Please sign up or login with your details

Forgot password? Click here to reset