Make The Most of Prior Data: A Solution for Interactive Text Summarization with Preference Feedback

04/12/2022
by   Duy Hung Nguyen, et al.
0

For summarization, human preference is critical to tame outputs of the summarizer in favor of human interests, as ground-truth summaries are scarce and ambiguous. Practical settings require dynamic exchanges between human and AI agent wherein feedback is provided in an online manner, a few at a time. In this paper, we introduce a new framework to train summarization models with preference feedback interactively. By properly leveraging offline data and a novel reward model, we improve the performance regarding ROUGE scores and sample-efficiency. Our experiments on three various datasets confirm the benefit of the proposed framework in active, few-shot and online settings of preference learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/10/2023

Improving Factuality of Abstractive Summarization via Contrastive Reward Learning

Modern abstractive summarization models often generate summaries that co...
research
05/06/2021

Hone as You Read: A Practical Type of Interactive Summarization

We present HARE, a new task where reader feedback is used to optimize do...
research
12/15/2022

Constitutional AI: Harmlessness from AI Feedback

As AI systems become more capable, we would like to enlist their help to...
research
09/01/2023

RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

Reinforcement learning from human feedback (RLHF) is effective at aligni...
research
04/12/2022

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

We apply preference modeling and reinforcement learning from human feedb...
research
03/01/2021

A Machine Learning Approach for Predicting Human Preference for Graph Layouts

Understanding what graph layout human prefer and why they prefer is sign...
research
02/27/2023

Active Reward Learning from Online Preferences

Robot policies need to adapt to human preferences and/or new environment...

Please sign up or login with your details

Forgot password? Click here to reset