Assessing the behavior and performance of a supervised term-weighting technique for topic-based retrieval

07/13/2020
by   Mariano Maisonnave, et al.
0

This article analyses and evaluates FDDe̱ṯa̱, a supervised term-weighting scheme that can be applied for query-term selection in topic-based retrieval. FDDe̱ṯa̱ weights terms based on two factors representing the descriptive and discriminating power of the terms with respect to the given topic. It then combines these two factor through the use of an adjustable parameter that allows to favor different aspects of retrieval, such as precision, recall or a balance between both. The article makes the following contributions: (1) it presents an extensive analysis of the behavior of FDDe̱ṯa̱ as a function of its adjustable parameter; (2) it compares FDDe̱ṯa̱ against eighteen traditional and state-of-the-art weighting scheme; (3) it evaluates the performance of disjunctive queries built by combining terms selected using the analyzed methods; (4) it introduces a new public data set with news labeled as relevant or irrelevant to the economic domain. The analysis and evaluations are performed on three data sets: two well-known text data sets, namely 20 Newsgroups and Reuters-21578, and the newly released data set. It is possible to conclude that despite its simplicity, FDDe̱ṯa̱ is competitive with state-of-the-art methods and has the important advantage of offering flexibility at the moment of adapting to specific task goals. The results also demonstrate that FDDe̱ṯa̱ offers a useful mechanism to explore different approaches to build complex queries.

READ FULL TEXT

page 18

page 19

page 20

page 24

page 25

page 27

research
07/13/2020

A supervised term-weighting technique for topic-based retrieval

This article presents a technique for term weighting that relies on a co...
research
02/27/2019

Query Term Weighting based on Query Performance Prediction

This work presents a general query term weighting approach based on quer...
research
12/13/2010

Inverse-Category-Frequency based supervised term weighting scheme for text categorization

Term weighting schemes often dominate the performance of many classifier...
research
03/28/2019

Learning to Weight for Text Classification

In information retrieval (IR) and related tasks, term weighting approach...
research
01/29/2019

A New Approach for Query Expansion using Wikipedia and WordNet

Query expansion (QE) is a well known technique to enhance the effectiven...
research
10/10/2016

Supervised Term Weighting Metrics for Sentiment Analysis in Short Text

Term weighting metrics assign weights to terms in order to discriminate ...

Please sign up or login with your details

Forgot password? Click here to reset