Do Language Models Know When They're Hallucinating References?

05/29/2023
by   Ayush Agrawal, et al.
0

Current state-of-the-art language models (LMs) are notorious for generating text with "hallucinations," a primary example being book and paper references that lack any solid basis in their training data. However, we find that many of these fabrications can be identified using the same LM, using only black-box queries without consulting any external resources. Consistency checks done with direct queries about whether the generated reference title is real (inspired by Kadavath et al. 2022, Lin et al. 2022, Manakul et al. 2023) are compared to consistency checks with indirect queries which ask for ancillary details such as the authors of the work. These consistency checks are found to be partially reliable indicators of whether or not the reference is a hallucination. In particular, we find that LMs in the GPT-series will hallucinate differing authors of hallucinated references when queried in independent sessions, while it will consistently identify authors of real references. This suggests that the hallucination may be more a result of generation techniques than the underlying representation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/24/2022

Language Models that Seek for Knowledge: Modular Search Generation for Dialogue and Prompt Completion

Language models (LMs) have recently been shown to generate more factual ...
research
04/13/2021

Detoxifying Language Models Risks Marginalizing Minority Voices

Language models (LMs) must be both safe and equitable to be responsibly ...
research
01/04/2021

Improving reference mining in patents with BERT

References in patents to scientific literature provide relevant informat...
research
09/24/2019

Do Massively Pretrained Language Models Make Better Storytellers?

Large neural language models trained on massive amounts of text have eme...
research
02/18/2018

Efficient Gradual Typing

Gradual typing combines static and dynamic typing in the same program. O...
research
02/09/2023

A 1.1- / 0.9-nA Temperature-Independent 213- / 565-ppm/^∘C Self-Biased CMOS-Only Current Reference in 65-nm Bulk and 22-nm FDSOI

In many applications, the ability of current references to cope with pro...

Please sign up or login with your details

Forgot password? Click here to reset