Chain-of-Verification Reduces Hallucination in Large Language Models

09/20/2023
by   Shehzaad Dhuliawala, et al.
0

Generation of plausible yet incorrect factual information, termed hallucination, is an unsolved issue in large language models. We study the ability of language models to deliberate on the responses they give in order to correct their mistakes. We develop the Chain-of-Verification (CoVe) method whereby the model first (i) drafts an initial response; then (ii) plans verification questions to fact-check its draft; (iii) answers those questions independently so the answers are not biased by other responses; and (iv) generates its final verified response. In experiments, we show CoVe decreases hallucinations across a variety of tasks, from list-based questions from Wikidata, closed book MultiSpanQA and longform text generation.

READ FULL TEXT

page 2

page 17

page 18

page 19

research
07/13/2023

Negated Complementary Commonsense using Large Language Models

Larger language models, such as GPT-3, have shown to be excellent in man...
research
05/30/2023

Chatbots put to the test in math and logic problems: A preliminary comparison and assessment of ChatGPT-3.5, ChatGPT-4, and Google Bard

A comparison between three chatbots which are based on large language mo...
research
02/13/2023

"Correct answers" from the psychology of artificial intelligence

Large Language Models have vastly grown in capabilities. One proposed ap...
research
04/21/2023

Who's the Best Detective? LLMs vs. MLs in Detecting Incoherent Fourth Grade Math Answers

Written answers to open-ended questions can have a higher long-term effe...
research
04/06/2023

ChatGPT-Crawler: Find out if ChatGPT really knows what it's talking about

Large language models have gained considerable interest for their impres...
research
08/04/2022

N-best Response-based Analysis of Contradiction-awareness in Neural Response Generation Models

Avoiding the generation of responses that contradict the preceding conte...
research
05/23/2023

Knowledge of Knowledge: Exploring Known-Unknowns Uncertainty with Large Language Models

This paper investigates the capabilities of Large Language Models (LLMs)...

Please sign up or login with your details

Forgot password? Click here to reset