Modular Representation Underlies Systematic Generalization in Neural Natural Language Inference Models

04/30/2020
by   Atticus Geiger, et al.
0

In adversarial (challenge) testing, we pose hard generalization tasks in order to gain insights into the solutions found by our models. What properties must a system have in order to succeed at these hard tasks? In this paper, we argue that an essential factor is the ability to form modular representations. Our central contribution is a definition of what it means for a representation to be modular and an experimental method for assessing the extent to which a system's solution is modular in this general sense. Our work is grounded empirically in a new challenge Natural Language Inference dataset designed to assess systems on their ability to reason about entailment and negation. We find that a BERT model with fine-tuning is strikingly successful at the hard generalization tasks we pose using this dataset, and our active manipulations help us to understand why: despite the densely interconnected nature of the BERT architecture, the learned model embeds modular, general theories of lexical entailment relations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/07/2020

Are Natural Language Inference Models IMPPRESsive? Learning IMPlicature and PRESupposition

Natural language inference (NLI) is an increasingly important task for n...
research
09/14/2021

On the Language-specificity of Multilingual BERT and the Impact of Fine-tuning

Recent work has shown evidence that the knowledge acquired by multilingu...
research
08/21/2015

A large annotated corpus for learning natural language inference

Understanding entailment and contradiction is fundamental to understandi...
research
12/02/2014

Tiered Clustering to Improve Lexical Entailment

Many tasks in Natural Language Processing involve recognizing lexical en...
research
09/11/2020

Systematic Generalization on gSCAN with Language Conditioned Embedding

Systematic Generalization refers to a learning algorithm's ability to ex...
research
08/07/2023

Towards Controllable Natural Language Inference through Lexical Inference Types

Explainable natural language inference aims to provide a mechanism to pr...
research
12/21/2021

Provable Hierarchical Lifelong Learning with a Sketch-based Modular Architecture

We propose a modular architecture for the lifelong learning of hierarchi...

Please sign up or login with your details

Forgot password? Click here to reset