ReCO: A Large Scale Chinese Reading Comprehension Dataset on Opinion

06/22/2020
by   BingningWang, et al.
0

This paper presents the ReCO, a human-curated ChineseReading Comprehension dataset on Opinion. The questions in ReCO are opinion based queries issued to the commercial search engine. The passages are provided by the crowdworkers who extract the support snippet from the retrieved documents. Finally, an abstractive yes/no/uncertain answer was given by the crowdworkers. The release of ReCO consists of 300k questions that to our knowledge is the largest in Chinese reading comprehension. A prominent characteristic of ReCO is that in addition to the original context paragraph, we also provided the support evidence that could be directly used to answer the question. Quality analysis demonstrates the challenge of ReCO that requires various types of reasoning skills, such as causal inference, logical reasoning, etc. Current QA models that perform very well on many question answering problems, such as BERT, only achieve 77 performance, indicating ReCO presents a good challenge for machine reading comprehension. The codes, datasets are freely available at https://github.com/benywon/ReCO.

READ FULL TEXT
research
09/16/2019

KorQuAD1.0: Korean QA Dataset for Machine Reading Comprehension

Machine Reading Comprehension (MRC) is a task that requires machine to u...
research
07/16/2020

LogiQA: A Challenge Dataset for Machine Reading Comprehension with Logical Reasoning

Machine reading is a fundamental task for testing the capability of natu...
research
10/18/2020

Towards Interpreting BERT for Reading Comprehension Based QA

BERT and its variants have achieved state-of-the-art performance in vari...
research
04/28/2020

The Curse of Performance Instability in Analysis Datasets: Consequences, Source, and Suggestions

We find that the performance of state-of-the-art models on Natural Langu...
research
06/16/2022

GAAMA 2.0: An Integrated System that Answers Boolean and Extractive Questions

Recent machine reading comprehension datasets include extractive and boo...
research
01/23/2021

WebSRC: A Dataset for Web-Based Structural Reading Comprehension

Web search is an essential way for human to obtain information, but it's...
research
11/14/2017

Towards Human-level Machine Reading Comprehension: Reasoning and Inference with Multiple Strategies

This paper presents a new MRC model that is capable of three key compreh...

Please sign up or login with your details

Forgot password? Click here to reset