Training Curricula for Open Domain Answer Re-Ranking

04/29/2020
by   Sean MacAvaney, et al.
0

In precision-oriented tasks like answer ranking, it is more important to rank many relevant answers highly than to retrieve all relevant answers. It follows that a good ranking strategy would be to learn how to identify the easiest correct answers first (i.e., assign a high ranking score to answers that have characteristics that usually indicate relevance, and a low ranking score to those with characteristics that do not), before incorporating more complex logic to handle difficult cases (e.g., semantic matching or reasoning). In this work, we apply this idea to the training of neural answer rankers using curriculum learning. We propose several heuristics to estimate the difficulty of a given training sample. We show that the proposed heuristics can be used to build a training curriculum that down-weights difficult samples early in the training process. As the training process begins, our approach gradually shifts to weighting all samples equally, regardless of difficulty. We present a comprehensive evaluation of our proposed idea on three answer ranking datasets. Results show that our approach leads to superior performance of two leading neural ranking architectures, namely BERT and ConvKNRM, using both pointwise and pairwise losses. When applied to a BERT-based ranker, our method yields up to a 4 trained without a curriculum. This results in models that can achieve comparable performance to more expensive state-of-the-art techniques.

READ FULL TEXT
research
10/20/2019

Image Difficulty Curriculum for Generative Adversarial Networks (CuGAN)

Despite the significant advances in recent years, Generative Adversarial...
research
02/02/2023

Human not in the loop: objective sample difficulty measures for Curriculum Learning

Curriculum learning is a learning method that trains models in a meaning...
research
02/01/2021

Hierarchical Ranking for Answer Selection

Answer selection is a task to choose the positive answers from a pool of...
research
09/04/2018

Improved Online Wilson Score Interval Method for Community Answer Quality Ranking

In this paper, a fast and easy-to-deploy method with a strong interpreta...
research
04/01/2020

Recommandation ontologique multicritère pour la métrologie

Matchmaking and information ranking are helping process for users, by of...
research
08/24/2021

Density-Based Dynamic Curriculum Learning for Intent Detection

Pre-trained language models have achieved noticeable performance on the ...
research
05/30/2019

Evaluating Artificial Systems for Pairwise Ranking Tasks Sensitive to Individual Differences

Owing to the advancement of deep learning, artificial systems are now ri...

Please sign up or login with your details

Forgot password? Click here to reset