Towards Theoretical Understanding of Weak Supervision for Information Retrieval

06/13/2018
by   Hamed Zamani, et al.
0

Neural network approaches have recently shown to be effective in several information retrieval (IR) tasks. However, neural approaches often require large volumes of training data to perform effectively, which is not always available. To mitigate the shortage of labeled data, training neural IR models with weak supervision has been recently proposed and received considerable attention in the literature. In weak supervision, an existing model automatically generates labels for a large set of unlabeled data, and a machine learning model is further trained on the generated "weak" data. Surprisingly, it has been shown in prior art that the trained neural model can outperform the weak labeler by a significant margin. Although these obtained improvements have been intuitively justified in previous work, the literature still lacks theoretical justification for the observed empirical findings. In this position paper, we propose to theoretically study weak supervision, in particular for IR tasks, e.g., learning to rank. We briefly review a set of our recent theoretical findings that shed light on learning from weakly supervised data, and provide guidelines on how train learning to rank models with weak supervision.

READ FULL TEXT

page 1

page 2

research
04/18/2023

Generalized Weak Supervision for Neural Information Retrieval

Neural ranking models (NRMs) have demonstrated effective performance in ...
research
04/28/2017

Neural Ranking Models with Weak Supervision

Despite the impressive improvements achieved by unsupervised deep neural...
research
12/29/2020

Meta Adaptive Neural Ranking with Contrastive Synthetic Supervision

Neural Information Retrieval (Neu-IR) models have shown their effectiven...
research
11/30/2017

Learning to Learn from Weak Supervision by Full Supervision

In this paper, we propose a method for training neural networks when we ...
research
01/28/2020

Selective Weak Supervision for Neural Information Retrieval

This paper democratizes neural information retrieval to scenarios where ...
research
05/07/2018

Learning Matching Models with Weak Supervision for Response Selection in Retrieval-based Chatbots

We propose a method that can leverage unlabeled data to learn a matching...
research
11/13/2018

Embedding Electronic Health Records for Clinical Information Retrieval

Neural network representation learning frameworks have recently shown to...

Please sign up or login with your details

Forgot password? Click here to reset