Learning to Search in Long Documents Using Document Structure

06/09/2018
by   Mor Geva, et al.
0

Reading comprehension models are based on recurrent neural networks that sequentially process the document tokens. As interest turns to answering more complex questions over longer documents, sequential reading of large portions of text becomes a substantial bottleneck. Inspired by how humans use document structure, we propose a novel framework for reading comprehension. We represent documents as trees, and model an agent that learns to interleave quick navigation through the document tree with more expensive answer extraction. To encourage exploration of the document tree, we propose a new algorithm, based on Deep Q-Network (DQN), which strategically samples tree nodes at training time. Empirically we find our algorithm improves question answering performance compared to DQN and a strong information-retrieval (IR) baseline, and that ensembling our model with the IR baseline results in further gains in performance.

READ FULL TEXT

page 8

page 15

page 16

research
12/19/2019

CJRC: A Reliable Human-Annotated Benchmark DataSet for Chinese Judicial Reading Comprehension

We present a Chinese judicial reading comprehension (CJRC) dataset which...
research
11/06/2016

Hierarchical Question Answering for Long Documents

We present a framework for question answering that can efficiently scale...
research
05/23/2019

Multi-hop Reading Comprehension via Deep Reinforcement Learning based Document Traversal

Reading Comprehension has received significant attention in recent years...
research
07/01/2020

DocVQA: A Dataset for VQA on Document Images

We present a new dataset for Visual Question Answering on document image...
research
05/01/2023

CHIC: Corporate Document for Visual question Answering

The massive use of digital documents due to the substantial trend of pap...
research
07/07/2011

Text Classification: A Sequential Reading Approach

We propose to model the text classification process as a sequential deci...
research
11/12/2017

Fast Reading Comprehension with ConvNets

State-of-the-art deep reading comprehension models are dominated by recu...

Please sign up or login with your details

Forgot password? Click here to reset