A Knowledge Hunting Framework for Common Sense Reasoning

10/02/2018
by   Ali Emami, et al.
0

We introduce an automatic system that achieves state-of-the-art results on the Winograd Schema Challenge (WSC), a common sense reasoning task that requires diverse, complex forms of inference and knowledge. Our method uses a knowledge hunting module to gather text from the web, which serves as evidence for candidate problem resolutions. Given an input problem, our system generates relevant queries to send to a search engine, then extracts and classifies knowledge from the returned results and weighs them to make a resolution. Our approach improves F1 performance on the full WSC by 0.21 over the previous best and represents the first system to exceed 0.5 F1. We further demonstrate that the approach is competitive on the Choice of Plausible Alternatives (COPA) task, which suggests that it is generally applicable.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/05/2018

On the Evaluation of Common-Sense Reasoning in Natural Language Understanding

The NLP and ML communities have long been interested in developing model...
research
01/14/2018

Top k Memory Candidates in Memory Networks for Common Sense Reasoning

Successful completion of reasoning task requires the agent to have relev...
research
01/08/2018

Winograd Schema - Knowledge Extraction Using Narrative Chains

The Winograd Schema Challenge (WSC) is a test of machine intelligence, d...
research
08/26/2019

Improving Neural Story Generation by Targeted Common Sense Grounding

Stories generated with neural language models have shown promise in gram...
research
01/06/2023

Witscript 3: A Hybrid AI System for Improvising Jokes in a Conversation

Previous papers presented Witscript and Witscript 2, AI systems for impr...
research
03/14/2023

ViperGPT: Visual Inference via Python Execution for Reasoning

Answering visual queries is a complex task that requires both visual pro...
research
11/02/2018

The Hard-CoRe Coreference Corpus: Removing Gender and Number Cues for Difficult Pronominal Anaphora Resolution

We introduce a new benchmark task for coreference resolution, Hard-CoRe,...

Please sign up or login with your details

Forgot password? Click here to reset