Generative Models as a Complex Systems Science: How can we make sense of large language model behavior?

07/31/2023
by   Ari Holtzman, et al.
0

Coaxing out desired behavior from pretrained models, while avoiding undesirable ones, has redefined NLP and is reshaping how we interact with computers. What was once a scientific engineering discipline-in which building blocks are stacked one on top of the other-is arguably already a complex systems science, in which emergent behaviors are sought out to support previously unimagined use cases. Despite the ever increasing number of benchmarks that measure task performance, we lack explanations of what behaviors language models exhibit that allow them to complete these tasks in the first place. We argue for a systematic effort to decompose language model behavior into categories that explain cross-task performance, to guide mechanistic explanations and help future-proof analytic research.

READ FULL TEXT

page 2

page 5

page 11

research
07/14/2022

Beware the Rationalization Trap! When Language Model Explainability Diverges from our Mental Models of Language

Language models learn and represent language differently than humans; th...
research
09/12/2023

Leveraging Large Language Models for Automated Dialogue Analysis

Developing high-performing dialogue systems benefits from the automatic ...
research
10/14/2021

Sparks: Inspiration for Science Writing using Language Models

Large-scale language models are rapidly improving, performing well on a ...
research
09/19/2023

Explaining Agent Behavior with Large Language Models

Intelligent agents such as robots are increasingly deployed in real-worl...
research
05/15/2023

DarkBERT: A Language Model for the Dark Side of the Internet

Recent research has suggested that there are clear differences in the la...
research
12/10/2020

Multi-Sense Language Modelling

The effectiveness of a language model is influenced by its token represe...
research
01/28/2023

Do Orcas Have Semantic Language? Machine Learning to Predict Orca Behaviors Using Partially Labeled Vocalization Data

Orcinus orca (killer whales) exhibit complex calls. They last about a se...

Please sign up or login with your details

Forgot password? Click here to reset