Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategies

08/06/2023
by   Liangming Pan, et al.
0

Large language models (LLMs) have demonstrated remarkable performance across a wide array of NLP tasks. However, their efficacy is undermined by undesired and inconsistent behaviors, including hallucination, unfaithful reasoning, and toxic content. A promising approach to rectify these flaws is self-correction, where the LLM itself is prompted or guided to fix problems in its own output. Techniques leveraging automated feedback – either produced by the LLM itself or some external system – are of particular interest as they are a promising way to make LLM-based solutions more practical and deployable with minimal human feedback. This paper presents a comprehensive review of this emerging class of techniques. We analyze and taxonomize a wide array of recent work utilizing these strategies, including training-time, generation-time, and post-hoc correction. We also summarize the major applications of this strategy and conclude by discussing future directions and challenges.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/03/2023

Does Correction Remain A Problem For Large Language Models?

As large language models, such as GPT, continue to advance the capabilit...
research
06/04/2019

The Unreasonable Effectiveness of Transformer Language Models in Grammatical Error Correction

Recent work on Grammatical Error Correction (GEC) has highlighted the im...
research
02/15/2023

The Capacity for Moral Self-Correction in Large Language Models

We test the hypothesis that language models trained with reinforcement l...
research
03/06/2022

Leashing the Inner Demons: Self-Detoxification for Language Models

Language models (LMs) can reproduce (or amplify) toxic language seen dur...
research
09/06/2023

Zero-Resource Hallucination Prevention for Large Language Models

The prevalent use of large language models (LLMs) in various domains has...
research
05/18/2023

Generalized Planning in PDDL Domains with Pretrained Large Language Models

Recent work has considered whether large language models (LLMs) can func...
research
06/03/2023

Guided scenarios with simulated expert personae: a remarkable strategy to perform cognitive work

Large language models (LLMs) trained on a substantial corpus of human kn...

Please sign up or login with your details

Forgot password? Click here to reset