Neural Ranking Models with Multiple Document Fields

11/25/2017
by   Hamed Zamani, et al.
0

Deep neural networks have recently shown promise in the ad-hoc retrieval task. However, such models have often been based on one field of the document, for example considering document title only or document body only. Since in practice documents typically have multiple fields, and given that non-neural ranking models such as BM25F have been developed to take advantage of document structure, this paper investigates how neural models can deal with multiple document fields. We introduce a model that can consume short text fields such as document title and long text fields such as document body. It can also handle multi-instance fields with variable number of instances, for example where each document has zero or more instances of incoming anchor text. Since fields vary in coverage and quality, we introduce a masking method to handle missing field instances, as well as a field-level dropout method to avoid relying too much on any one field. As in the studies of non-neural field weighting, we find it is better for the ranker to score the whole document jointly, rather than generate a per-field score and aggregate. We find that different document fields may match different aspects of the query and therefore benefit from comparing with separate representations of the query text. The combination of techniques introduced here leads to a neural ranker that can take advantage of full document structure, including multiple instance and missing instance data, of variable length. The techniques significantly enhance the performance of the ranker, and also outperform a learning to rank baseline with hand-crafted features.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/18/2020

Non-Linear Multiple Field Interactions Neural Document Ranking

Ranking tasks are usually based on the text of the main body of the page...
research
03/16/2021

A Neural Passage Model for Ad-hoc Document Retrieval

Traditional statistical retrieval models often treat each document as a ...
research
05/12/2020

Interpreting Neural Ranking Models using Grad-CAM

Recently, applying deep neural networks in IR has become an important an...
research
12/03/2019

TU Wien @ TREC Deep Learning '19 – Simple Contextualization for Re-ranking

The usage of neural network models puts multiple objectives in conflict ...
research
06/23/2021

Learnt Sparsity for Effective and Interpretable Document Ranking

Machine learning models for the ad-hoc retrieval of documents and passag...
research
07/16/2018

Repeatability Corner Cases in Document Ranking: The Impact of Score Ties

Document ranking experiments should be repeatable: running the same rank...
research
05/12/2021

Multi-Field Models in Neural Recipe Ranking – An Early Exploratory Study

Explicitly modelling field interactions and correlations in complex docu...

Please sign up or login with your details

Forgot password? Click here to reset