Automated Query Learning with Wikipedia and Genetic Programming

12/03/2010
by   Pekka Malo, et al.
0

Most of the existing information retrieval systems are based on bag of words model and are not equipped with common world knowledge. Work has been done towards improving the efficiency of such systems by using intelligent algorithms to generate search queries, however, not much research has been done in the direction of incorporating human-and-society level knowledge in the queries. This paper is one of the first attempts where such information is incorporated into the search queries using Wikipedia semantics. The paper presents an essential shift from conventional token based queries to concept based queries, leading to an enhanced efficiency of information retrieval systems. To efficiently handle the automated query learning problem, we propose Wikipedia-based Evolutionary Semantics (Wiki-ES) framework where concept based queries are learnt using a co-evolving evolutionary procedure. Learning concept based queries using an intelligent evolutionary procedure yields significant improvement in performance which is shown through an extensive study using Reuters newswire documents. Comparison of the proposed framework is performed with other information retrieval systems. Concept based approach has also been implemented on other information retrieval systems to justify the effectiveness of a transition from token based queries to concept based queries.

READ FULL TEXT
research
10/18/2022

Towards Proactive Information Retrieval in Noisy Text with Wikipedia Concepts

Extracting useful information from the user history to clearly understan...
research
07/15/2020

Deep Reinforced Query Reformulation for Information Retrieval

Query reformulations have long been a key mechanism to alleviate the voc...
research
04/21/2020

Use of Wikipedia categories on information retrieval research: a brief review

Wikipedia categories, a classification scheme built for organizing and d...
research
12/04/2021

Efficient Deterministic Quantitative Group Testing for Precise Information Retrieval

The Quantitative Group Testing (QGT) is about learning a (hidden) subset...
research
02/09/2023

Incorporating Total Variation Regularization in the design of an intelligent Query by Humming system

A Query-By-Humming (QBH) system constitutes a particular case of music i...
research
05/19/2023

QUEST: A Retrieval Dataset of Entity-Seeking Queries with Implicit Set Operations

Formulating selective information needs results in queries that implicit...
research
08/28/2018

Automated Query Expansion using High Dimensional Clustering

The exponential growth of information on the Internet has created a big ...

Please sign up or login with your details

Forgot password? Click here to reset