Attention Is (not) All You Need for Commonsense Reasoning

05/31/2019
by   Tassilo Klein, et al.
1

The recently introduced BERT model exhibits strong performance on several language understanding benchmarks. In this paper, we describe a simple re-implementation of BERT for commonsense reasoning. We show that the attentions produced by BERT can be directly utilized for tasks such as the Pronoun Disambiguation Problem and Winograd Schema Challenge. Our proposed attention-guided commonsense reasoning method is conceptually simple yet empirically powerful. Experimental analysis on multiple datasets demonstrates that our proposed system performs remarkably well on all cases while outperforming the previously reported state of the art by a margin. While results suggest that BERT seems to implicitly learn to establish complex relationships between entities, solving commonsense reasoning tasks might require more than unsupervised models learned from huge text corpora.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/02/2020

Contrastive Self-Supervised Learning for Commonsense Reasoning

We propose a self-supervised method to solve Pronoun Disambiguation and ...
research
10/01/2018

A Simple Machine Learning Method for Commonsense Reasoning? A Short Commentary on Trinh & Le (2018)

This is a short Commentary on Trinh & Le (2018) ("A Simple Method for Co...
research
12/13/2020

KVL-BERT: Knowledge Enhanced Visual-and-Linguistic BERT for Visual Commonsense Reasoning

Reasoning is a critical ability towards complete visual understanding. T...
research
09/28/2020

Knowledge-Aware Procedural Text Understanding with Multi-Stage Training

We focus on the task of procedural text understanding, which aims to tra...
research
10/02/2019

Cracking the Contextual Commonsense Code: Understanding Commonsense Reasoning Aptitude of Deep Contextual Representations

Pretrained deep contextual representations have advanced the state-of-th...
research
05/15/2019

A Surprisingly Robust Trick for Winograd Schema Challenge

The Winograd Schema Challenge (WSC) dataset WSC273 and its inference cou...
research
06/07/2018

A Simple Method for Commonsense Reasoning

Commonsense reasoning is a long-standing challenge for deep learning. Fo...

Please sign up or login with your details

Forgot password? Click here to reset