A New Approach for Query Expansion using Wikipedia and WordNet

01/29/2019
by   Hiteshwar Kumar Azad, et al.
0

Query expansion (QE) is a well known technique to enhance the effectiveness of information retrieval (IR). QE reformulates the initial query by adding similar terms that helps in retrieving more relevant results. Several approaches have been proposed with remarkable outcome, but they are not evenly favorable for all types of queries. One of the main reasons for this is the use of the same data source while expanding both the individual and the phrase query terms. As a result, the holistic relationship among the query terms is not well captured. To address this issue, we have selected separate data sources for individual and phrase terms. Specifically, we have used WordNet for expanding individual terms and Wikipedia for expanding phrase terms. We have also proposed novel schemes for weighting expanded terms: inlink score (for terms extracted from Wikipedia) and a tfidf based scheme (for terms extracted from WordNet). In the proposed Wikipedia WordNet based QE technique (WWQE), we weigh the expansion terms twice: first, they are scored by the weighting scheme individually, and then, the weighting scheme scores the selected expansion terms in relation to the entire query using correlation score. The experimental results show that the proposed approach successfully combines Wikipedia and WordNet as demonstrated through a better performance on standard evaluation metrics on FIRE dataset. The proposed WWQE approach is also suitable with other standard weighting models for improving the effectiveness of IR.

READ FULL TEXT

page 16

page 17

research
08/27/2019

A novel model for query expansion using pseudo-relevant web knowledge

In the field of information retrieval, query expansion (QE) has long bee...
research
01/30/2013

Query Expansion in Information Retrieval Systems using a Bayesian Network-Based Thesaurus

Information Retrieval (IR) is concerned with the identification of docum...
research
11/25/2017

Acronym Disambiguation: A Domain Independent Approach

Acronyms are omnipresent. They usually express information that is repet...
research
02/22/2022

Query Expansion and Entity Weighting for Query Reformulation Retrieval in Voice Assistant Systems

Voice assistants such as Alexa, Siri, and Google Assistant have become i...
research
11/23/2017

Wiki-MetaSemantik: A Wikipedia-derived Query Expansion Approach based on Network Properties

This paper discusses the use of Wikipedia for building semantic ontologi...
research
07/13/2020

Assessing the behavior and performance of a supervised term-weighting technique for topic-based retrieval

This article analyses and evaluates FDDe̱ṯa̱, a supervised term-weightin...
research
02/22/2023

Effectiveness and Efficiency Trade-off in Selective Query Processing

Query processing in search engines can be optimized for use for all quer...

Please sign up or login with your details

Forgot password? Click here to reset