Building Large Machine Reading-Comprehension Datasets using Paragraph Vectors

12/13/2016
by   Radu Soricut, et al.
0

We present a dual contribution to the task of machine reading-comprehension: a technique for creating large-sized machine-comprehension (MC) datasets using paragraph-vector models; and a novel, hybrid neural-network architecture that combines the representation power of recurrent neural networks with the discriminative power of fully-connected multi-layered networks. We use the MC-dataset generation technique to build a dataset of around 2 million examples, for which we empirically determine the high-ceiling of human performance (around 91 computer models. Among all the models we have experimented with, our hybrid neural-network architecture achieves the highest performance (83.2 The remaining gap to the human-performance ceiling provides enough room for future model improvements.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/08/2016

Consensus Attention-based Neural Networks for Chinese Reading Comprehension

Reading comprehension has embraced a booming in recent NLP research. Sev...
research
03/28/2019

Sogou Machine Reading Comprehension Toolkit

Machine reading comprehension have been intensively studied in recent ye...
research
08/27/2018

Comparing Attention-based Convolutional and Recurrent Neural Networks: Success and Limitations in Machine Reading Comprehension

We propose a machine reading comprehension model based on the compare-ag...
research
03/15/2018

HFL-RC System at SemEval-2018 Task 11: Hybrid Multi-Aspects Model for Commonsense Reading Comprehension

This paper describes the system which got the state-of-the-art results a...
research
06/29/2017

Two-Stage Synthesis Networks for Transfer Learning in Machine Comprehension

We develop a technique for transfer learning in machine comprehension (M...
research
11/23/2016

Emergent Predication Structure in Hidden State Vectors of Neural Readers

A significant number of neural architectures for reading comprehension h...
research
03/26/2018

CliCR: A Dataset of Clinical Case Reports for Machine Reading Comprehension

We present a new dataset for machine comprehension in the medical domain...

Please sign up or login with your details

Forgot password? Click here to reset