On Task-Adaptive Pretraining for Dialogue Response Selection

10/08/2022
by   Tzu-Hsiang Lin, et al.
0

Recent advancements in dialogue response selection (DRS) are based on the task-adaptive pre-training (TAP) approach, by first initializing their model with BERT <cit.>, and adapt to dialogue data with dialogue-specific or fine-grained pre-training tasks. However, it is uncertain whether BERT is the best initialization choice, or whether the proposed dialogue-specific fine-grained learning tasks are actually better than MLM+NSP. This paper aims to verify assumptions made in previous works and understand the source of improvements for DRS. We show that initializing with RoBERTa achieve similar performance as BERT, and MLM+NSP can outperform all previously proposed TAP tasks, during which we also contribute a new state-of-the-art on the Ubuntu corpus. Additional analyses shows that the main source of improvements comes from the TAP step, and that the NSP task is crucial to DRS, different from common NLU tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/15/2020

ToD-BERT: Pre-trained Natural Language Understanding for Task-Oriented Dialogues

The use of pre-trained language models has emerged as a promising direct...
research
11/19/2021

Small Changes Make Big Differences: Improving Multi-turn Response Selection in Dialogue Systems via Fine-Grained Contrastive Learning

Retrieve-based dialogue response selection aims to find a proper respons...
research
03/19/2022

Learning-by-Narrating: Narrative Pre-Training for Zero-Shot Dialogue Comprehension

Comprehending a dialogue requires a model to capture diverse kinds of ke...
research
06/11/2021

BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data

Maintaining consistent personas is essential for dialogue agents. Althou...
research
07/08/2022

DSTEA: Dialogue State Tracking with Entity Adaptive Pre-training

Dialogue state tracking (DST) is a core sub-module of a dialogue system,...
research
02/27/2020

Masking Orchestration: Multi-task Pretraining for Multi-role Dialogue Representation Learning

Multi-role dialogue understanding comprises a wide range of diverse task...
research
02/25/2021

BERT-based Acronym Disambiguation with Multiple Training Strategies

Acronym disambiguation (AD) task aims to find the correct expansions of ...

Please sign up or login with your details

Forgot password? Click here to reset