Rewarding Chatbots for Real-World Engagement with Millions of Users

03/10/2023
by   Robert Irvine, et al.
0

The emergence of pretrained large language models has led to the deployment of a range of social chatbots for chitchat. Although these chatbots demonstrate language ability and fluency, they are not guaranteed to be engaging and can struggle to retain users. This work investigates the development of social chatbots that prioritize user engagement to enhance retention, specifically examining the use of human feedback to efficiently develop highly engaging chatbots. The proposed approach uses automatic pseudo-labels collected from user interactions to train a reward model that can be used to reject low-scoring sample responses generated by the chatbot model at inference time. Intuitive evaluation metrics, such as mean conversation length (MCL), are introduced as proxies to measure the level of engagement of deployed chatbots. A/B testing on groups of 10,000 new daily chatbot users on the Chai Research platform shows that this approach increases the MCL by up to 70 translates to a more than 30 Future work aims to use the reward model to realise a data fly-wheel, where the latest user conversations can be used to alternately fine-tune the language model and the reward model.

READ FULL TEXT
research
10/21/2021

Modeling Performance in Open-Domain Dialogue with PARADISE

There has recently been an explosion of work on spoken dialogue systems,...
research
02/19/2022

Reward Modeling for Mitigating Toxicity in Transformer-based Language Models

Transformer-based language models are able to generate fluent text and b...
research
10/23/2021

Towards User Engagement Dynamics in Social Networks

The engagement of each user in a social network is an essential indicato...
research
08/04/2021

Using Interaction Data to Predict Engagement with Interactive Media

Media is evolving from traditional linear narratives to personalised exp...
research
03/19/2021

Play the Shannon Game With Language Models: A Human-Free Approach to Summary Evaluation

The goal of a summary is to concisely state the most important informati...
research
06/23/2019

Systematic improvement of user engagement with academic titles using computational linguistics

This paper describes a novel approach to systematically improve informat...
research
08/21/2020

From Optimizing Engagement to Measuring Value

Most recommendation engines today are based on predicting user engagemen...

Please sign up or login with your details

Forgot password? Click here to reset