LogiQA: A Challenge Dataset for Machine Reading Comprehension with Logical Reasoning

07/16/2020
by   Jian Liu, et al.
0

Machine reading is a fundamental task for testing the capability of natural language understanding, which is closely related to human cognition in many aspects. With the rising of deep learning techniques, algorithmic models rival human performances on simple QA, and thus increasingly challenging machine reading datasets have been proposed. Though various challenges such as evidence integration and commonsense knowledge have been integrated, one of the fundamental capabilities in human reading, namely logical reasoning, is not fully investigated. We build a comprehensive dataset, named LogiQA, which is sourced from expert-written questions for testing human Logical reasoning. It consists of 8,678 QA instances, covering multiple types of deductive reasoning. Results show that state-of-the-art neural models perform by far worse than human ceiling. Our dataset can also serve as a benchmark for reinvestigating logical AI under the deep learning NLP setting. The dataset is freely available at https://github.com/lgw863/LogiQA-dataset

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/22/2020

ReCO: A Large Scale Chinese Reading Comprehension Dataset on Opinion

This paper presents the ReCO, a human-curated ChineseReading Comprehensi...
research
11/10/2020

Natural Language Inference in Context – Investigating Contextual Reasoning over Long Texts

Natural language inference (NLI) is a fundamental NLP task, investigatin...
research
12/31/2020

Coreference Reasoning in Machine Reading Comprehension

The ability to reason about multiple references to a given entity is ess...
research
05/21/2021

Fact-driven Logical Reasoning

Logical reasoning, which is closely related to human cognition, is of vi...
research
04/27/2020

PuzzLing Machines: A Challenge on Learning From Small Data

Deep neural models have repeatedly proved excellent at memorizing surfac...
research
04/27/2023

ChatLog: Recording and Analyzing ChatGPT Across Time

While there are abundant researches about evaluating ChatGPT on natural ...
research
10/05/2020

Interactive Fiction Game Playing as Multi-Paragraph Reading Comprehension with Reinforcement Learning

Interactive Fiction (IF) games with real human-written natural language ...

Please sign up or login with your details

Forgot password? Click here to reset