Language Model Behavior: A Comprehensive Survey

03/20/2023
by   Tyler A. Chang, et al.
0

Transformer language models have received widespread public attention, yet their generated text is often surprising even to NLP researchers. In this survey, we discuss over 250 recent studies of English language model behavior before task-specific fine-tuning. Language models possess basic capabilities in syntax, semantics, pragmatics, world knowledge, and reasoning, but these capabilities are sensitive to specific inputs and surface features. Despite dramatic increases in generated text quality as models scale to hundreds of billions of parameters, the models are still prone to unfactual responses, commonsense errors, memorized text, and social biases. Many of these weaknesses can be framed as over-generalizations or under-generalizations of learned patterns in text. We synthesize recent results to highlight what is currently known about what large language models can and cannot do.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/01/2021

WARP: Word-level Adversarial ReProgramming

Transfer learning from pretrained language models recently became the do...
research
03/06/2023

Spelling convention sensitivity in neural language models

We examine whether large neural language models, trained on very large c...
research
06/13/2023

Questioning the Survey Responses of Large Language Models

As large language models increase in capability, researchers have starte...
research
04/02/2023

Eight Things to Know about Large Language Models

The widespread public deployment of large language models (LLMs) in rece...
research
01/02/2021

On-the-Fly Attention Modularization for Neural Generation

Despite considerable advancements with deep neural language models (LMs)...
research
05/23/2023

GenSpectrum Chat: Data Exploration in Public Health Using Large Language Models

Introduction: The COVID-19 pandemic highlighted the importance of making...
research
08/21/2023

Can Language Models Learn to Listen?

We present a framework for generating appropriate facial responses from ...

Please sign up or login with your details

Forgot password? Click here to reset