TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents

01/23/2019
by   Thomas Wolf, et al.
0

We introduce a new approach to generative data-driven dialogue systems (e.g. chatbots) called TransferTransfo which is a combination of a Transfer learning based training scheme and a high-capacity Transformer model. Fine-tuning is performed by using a multi-task objective which combines several unsupervised prediction tasks. The resulting fine-tuned model shows strong improvements over the current state-of-the-art end-to-end conversational models like memory augmented seq2seq and information-retrieval models. On the privately held PERSONA-CHAT dataset of the Conversational Intelligence Challenge 2, this approach obtains a new state-of-the-art, with respective perplexity, Hits@1 and F1 metrics of 16.28 (45 improvement) and 19.5 (20

READ FULL TEXT
research
10/12/2021

Småprat: DialoGPT for Natural Language Generation of Swedish Dialogue by Transfer Learning

Building open-domain conversational systems (or chatbots) that produce c...
research
10/15/2021

Few-Shot Bot: Prompt-Based Learning for Dialogue Systems

Learning to converse using only a few examples is a great challenge in c...
research
04/22/2022

Detecting early signs of depression in the conversational domain: The role of transfer learning in low-resource scenarios

The high prevalence of depression in society has given rise to the need ...
research
12/02/2020

Interactive Teaching for Conversational AI

Current conversational AI systems aim to understand a set of pre-designe...
research
02/21/2019

Incremental Transfer Learning in Two-pass Information Bottleneck based Speaker Diarization System for Meetings

The two-pass information bottleneck (TPIB) based speaker diarization sys...
research
05/08/2023

Multi-Task End-to-End Training Improves Conversational Recommendation

In this paper, we analyze the performance of a multitask end-to-end tran...
research
12/28/2019

All-in-One Image-Grounded Conversational Agents

As single-task accuracy on individual language and image tasks has impro...

Please sign up or login with your details

Forgot password? Click here to reset