A novel model for query expansion using pseudo-relevant web knowledge

08/27/2019
by   Hiteshwar Kumar Azad, et al.
0

In the field of information retrieval, query expansion (QE) has long been used as a technique to deal with the fundamental issue of word mismatch between a user's query and the target information. In the context of the relationship between the query and expanded terms, existing weighting techniques often fail to appropriately capture the term-term relationship and term to the whole query relationship, resulting in low retrieval effectiveness. Our proposed QE approach addresses this by proposing three weighting models based on (1) tf-itf, (2) k-nearest neighbor (kNN) based cosine similarity, and (3) correlation score. Further, to extract the initial set of expanded terms, we use pseudo-relevant web knowledge consisting of the top N web pages returned by the three popular search engines namely, Google, Bing, and DuckDuckGo, in response to the original query. Among the three weighting models, tf-itf scores each of the individual terms obtained from the web content, kNN-based cosine similarity scores the expansion terms to obtain the term-term relationship, and correlation score weighs the selected expansion terms with respect to the whole query. The proposed model, called web knowledge based query expansion (WKQE), achieves an improvement of 25.89 over the unexpanded queries on the FIRE dataset. A comparative analysis of the WKQE techniques with other related approaches clearly shows significant improvement in the retrieval performance. We have also analyzed the effect of varying the number of pseudo-relevant documents and expansion terms on the retrieval effectiveness of the proposed model.

READ FULL TEXT

page 8

page 22

page 24

research
01/29/2019

A New Approach for Query Expansion using Wikipedia and WordNet

Query expansion (QE) is a well known technique to enhance the effectiven...
research
02/27/2019

Query Term Weighting based on Query Performance Prediction

This work presents a general query term weighting approach based on quer...
research
02/22/2023

Effectiveness and Efficiency Trade-off in Selective Query Processing

Query processing in search engines can be optimized for use for all quer...
research
07/25/2017

Learning Word Relatedness over Time

Search systems are often focused on providing relevant results for the "...
research
08/28/2018

Automated Query Expansion using High Dimensional Clustering

The exponential growth of information on the Internet has created a big ...
research
02/22/2022

Query Expansion and Entity Weighting for Query Reformulation Retrieval in Voice Assistant Systems

Voice assistants such as Alexa, Siri, and Google Assistant have become i...

Please sign up or login with your details

Forgot password? Click here to reset