Exploiting Hierarchical Dependence Structures for Unsupervised Rank Fusion in Information Retrieval

08/10/2022
by   J. Hermosillo-Valadez, et al.
0

The goal of rank fusion in information retrieval (IR) is to deliver a single output list from multiple search results. Improving performance by combining the outputs of various IR systems is a challenging task. A central point is the fact that many non-obvious factors are involved in the estimation of relevance, inducing nonlinear interrelations between the data. The ability to model complex dependency relationships between random variables has become increasingly popular in the realm of information retrieval, and the need to further explore these dependencies for data fusion has been recently acknowledged. Copulas provide a framework to separate the dependence structure from the margins. Inspired by the theory of copulas, we propose a new unsupervised, dynamic, nonlinear, rank fusion method, based on a nested composition of non-algebraic function pairs. The dependence structure of the model is tailored by leveraging query-document correlations on a per-query basis. We experimented with three topic sets over CLEF corpora fusing 3 and 6 retrieval systems, comparing our method against the CombMNZ technique and other nonlinear unsupervised strategies. The experiments show that our fusion approach improves performance under explicit conditions, providing insight about the circumstances under which linear fusion techniques have comparable performance to nonlinear methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/05/2019

Information Retrieval and Its Sister Disciplines

This article presents a summary graph to show the relationships between ...
research
01/26/2021

Regulatory Compliance through Doc2Doc Information Retrieval: A case study in EU/UK legislation where text similarity has limitations

Major scandals in corporate history have urged the need for regulatory c...
research
12/10/2020

An Integrated Search Framework for Leveraging the Knowledge-Based Web Ecosystem

The explosion of information constrains the judgement of search terms as...
research
05/24/2023

Fusion-in-T5: Unifying Document Ranking Signals for Improved Information Retrieval

Common IR pipelines are typically cascade systems that may involve multi...
research
06/15/2023

Prompt Performance Prediction for Generative IR

The ability to predict the performance of a query in Information Retriev...
research
01/28/2020

Selective Weak Supervision for Neural Information Retrieval

This paper democratizes neural information retrieval to scenarios where ...
research
08/21/2023

Evaluating Temporal Persistence Using Replicability Measures

In real-world Information Retrieval (IR) experiments, the Evaluation Env...

Please sign up or login with your details

Forgot password? Click here to reset