A Simple Explanation for the Phase Transition in Large Language Models with List Decoding

03/23/2023
by   Cheng-Shang Chang, et al.
0

Various recent experimental results show that large language models (LLM) exhibit emergent abilities that are not present in small models. System performance is greatly improved after passing a certain critical threshold of scale. In this letter, we provide a simple explanation for such a phase transition phenomenon. For this, we model an LLM as a sequence-to-sequence random function. Instead of using instant generation at each step, we use a list decoder that keeps a list of candidate sequences at each step and defers the generation of the output sequence at the end. We show that there is a critical threshold such that the expected number of erroneous candidate sequences remains bounded when an LLM is below the threshold, and it grows exponentially when an LLM is above the threshold. Such a threshold is related to the basic reproduction number in a contagious disease.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/14/2019

Comparison of Diverse Decoding Methods from Conditional Language Models

While conditional language models have greatly improved in their ability...
research
11/13/2019

The Number of Threshold Words on n Letters Grows Exponentially for Every n≥ 27

For every n≥ 27, we show that the number of n/(n-1)^+-free words (i.e., ...
research
08/08/2023

Learning Evaluation Models from Large Language Models for Sequence Generation

Large language models achieve state-of-the-art performance on sequence g...
research
05/25/2021

Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization

The Lottery Ticket Hypothesis suggests that an over-parametrized network...
research
03/07/2014

On the Sequence of State Configurations in the Garden of Eden

Autonomous threshold element circuit networks are used to investigate th...
research
05/27/2019

Statistical Learning Aided List Decoding of Semi-Random Block Oriented Convolutional Codes

In this paper, we propose a statistical learning aided list decoding alg...
research
02/15/2018

List Heaps

This paper presents a simple extension of the binary heap, the List Heap...

Please sign up or login with your details

Forgot password? Click here to reset