User-Controlled Knowledge Fusion in Large Language Models: Balancing Creativity and Hallucination

07/30/2023
by   Chen Zhang, et al.
0

In modern dialogue systems, the use of Large Language Models (LLMs) has grown exponentially due to their capacity to generate diverse, relevant, and creative responses. Despite their strengths, striking a balance between the LLMs' creativity and their faithfulness to external knowledge remains a key challenge. This paper presents an innovative user-controllable mechanism that modulates the balance between an LLM's imaginative capabilities and its adherence to factual information. Our approach incorporates a numerical tag during the fine-tuning phase of the LLM's training, representing the degree of faithfulness to the reference knowledge in the generated responses. This degree is computed through an automated process that measures lexical overlap using ROUGE scores, semantic similarity using Sentence-BERT embeddings, and an LLM's self-evaluation score. During model inference, users can manipulate this numerical tag, thus controlling the degree of the LLM's reliance on external knowledge. We conduct extensive experiments across various scenarios, demonstrating the adaptability of our method and its efficacy in ensuring the quality and accuracy of the LLM's responses. The results highlight the potential of our approach to enhance the versatility of LLMs while maintaining a balance between creativity and hallucination.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/21/2022

A Comparative Study on Language Models for Task-Oriented Dialogue Systems

The recent development of language models has shown promising results by...
research
11/18/2020

Predicting metrical patterns in Spanish poetry with language models

In this paper, we compare automated metrical pattern identification syst...
research
10/09/2020

Plug-and-Play Conversational Models

There has been considerable progress made towards conversational models ...
research
05/29/2023

Do Large Language Models Know What They Don't Know?

Large language models (LLMs) have a wealth of knowledge that allows them...
research
04/30/2020

Unsupervised Injection of Knowledge into Dialogue Generation via Language Models

Neural conversation models have shown the power to produce more meaningf...
research
05/24/2023

Evaluate What You Can't Evaluate: Unassessable Generated Responses Quality

LLMs (large language models) such as ChatGPT have shown remarkable langu...
research
08/03/2023

Improving Requirements Completeness: Automated Assistance through Large Language Models

Natural language (NL) is arguably the most prevalent medium for expressi...

Please sign up or login with your details

Forgot password? Click here to reset