The Rediscovery Hypothesis: Language Models Need to Meet Linguistics

03/02/2021
by   Vassilina Nikoulina, et al.
0

There is an ongoing debate in the NLP community whether modern language models contain linguistic knowledge, recovered through so-called probes. In this paper we study whether linguistic knowledge is a necessary condition for good performance of modern language models, which we call the rediscovery hypothesis. In the first place we show that language models that are significantly compressed but perform well on their pretraining objectives retain good scores when probed for linguistic structures. This result supports the rediscovery hypothesis and leads to the second contribution of our paper: an information-theoretic framework that relates language modeling objective with linguistic information. This framework also provides a metric to measure the impact of linguistic information on the word prediction task. We reinforce our analytical results with various experiments, both on synthetic and on real tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/03/2022

Overcoming Barriers to Skill Injection in Language Modeling: Case Study in Arithmetic

Through their transfer learning abilities, highly-parameterized large pr...
research
02/06/2021

Does He Wink or Does He Nod? A Challenging Benchmark for Evaluating Word Understanding of Language Models

Recent progress in pretraining language models on large corpora has resu...
research
06/04/2019

The Unreasonable Effectiveness of Transformer Language Models in Grammatical Error Correction

Recent work on Grammatical Error Correction (GEC) has highlighted the im...
research
09/20/2023

Exploring the Relationship between LLM Hallucinations and Prompt Linguistic Nuances: Readability, Formality, and Concreteness

As Large Language Models (LLMs) have advanced, they have brought forth n...
research
05/22/2023

Prompt-based methods may underestimate large language models' linguistic generalizations

Prompting is now a dominant method for evaluating the linguistic knowled...
research
05/20/2023

Revisiting Entropy Rate Constancy in Text

The uniform information density (UID) hypothesis states that humans tend...
research
09/23/2021

Revisiting the Uniform Information Density Hypothesis

The uniform information density (UID) hypothesis posits a preference amo...

Please sign up or login with your details

Forgot password? Click here to reset