Unlocking the Potential of User Feedback: Leveraging Large Language Model as User Simulator to Enhance Dialogue System

06/16/2023
by   Zhiyuan Hu, et al.
0

Dialogue systems and large language models (LLMs) have gained considerable attention. However, the direct utilization of LLMs as task-oriented dialogue (TOD) models has been found to underperform compared to smaller task-specific models. Nonetheless, it is crucial to acknowledge the significant potential of LLMs and explore improved approaches for leveraging their impressive abilities. Motivated by the goal of leveraging LLMs, we propose an alternative approach called User-Guided Response Optimization (UGRO) to combine it with a smaller TOD model. This approach uses LLM as annotation-free user simulator to assess dialogue responses, combining them with smaller fine-tuned end-to-end TOD models. By utilizing the satisfaction feedback generated by LLMs, UGRO further optimizes the supervised fine-tuned TOD model. Specifically, the TOD model takes the dialogue history as input and, with the assistance of the user simulator's feedback, generates high-satisfaction responses that meet the user's requirements. Through empirical experiments on two TOD benchmarks, we validate the effectiveness of our method. The results demonstrate that our approach outperforms previous state-of-the-art (SOTA) results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/16/2023

Enhancing Large Language Model Induced Task-Oriented Dialogue Systems Through Look-Forward Motivated Goals

Recently, the development of large language models (LLMs) has been signi...
research
12/31/2020

Refine and Imitate: Reducing Repetition and Inconsistency in Persuasion Dialogues via Reinforcement Learning and Human Demonstration

Despite the recent success of large-scale language models on various dow...
research
11/11/2018

User Modeling for Task Oriented Dialogues

We introduce end-to-end neural network based models for simulating users...
research
09/10/2019

A Corpus-free State2Seq User Simulator for Task-oriented Dialogue

Recent reinforcement learning algorithms for task-oriented dialogue syst...
research
12/03/2020

Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries

Despite end-to-end neural systems making significant progress in the las...
research
06/02/2023

EmoUS: Simulating User Emotions in Task-Oriented Dialogues

Existing user simulators (USs) for task-oriented dialogue systems only m...
research
05/04/2023

ChatGPT-steered Editing Instructor for Customization of Abstractive Summarization

Tailoring outputs of large language models, such as ChatGPT, to specific...

Please sign up or login with your details

Forgot password? Click here to reset