TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialog

10/21/2020
by   Erik Ekstedt, et al.
0

Syntactic and pragmatic completeness is known to be important for turn-taking prediction, but so far machine learning models of turn-taking have used such linguistic information in a limited way. In this paper, we introduce TurnGPT, a transformer-based language model for predicting turn-shifts in spoken dialog. The model has been trained and evaluated on a variety of written and spoken dialog datasets. We show that the model outperforms two baselines used in prior work. We also report on an ablation study, as well as attention and gradient analyses, which show that the model is able to utilize the dialog context and pragmatic completeness for turn-taking prediction. Finally, we explore the model's potential in not only detecting, but also projecting, turn-completions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/19/2022

Voice Activity Projection: Self-supervised Learning of Turn-taking Events

The modeling of turn-taking in dialog can be viewed as the modeling of t...
research
01/29/2023

Learning Analytics from Spoken Discussion Dialogs in Flipped Classroom

The flipped classroom is a new pedagogical strategy that has been gainin...
research
08/31/2018

Multimodal Continuous Turn-Taking Prediction Using Multiscale RNNs

In human conversational interactions, turn-taking exchanges can be coord...
research
05/28/2019

An Incremental Turn-Taking Model For Task-Oriented Dialog Systems

In a human-machine dialog scenario, deciding the appropriate time for th...
research
05/09/2018

Improving End-of-turn Detection in Spoken Dialogues by Detecting Speaker Intentions as a Secondary Task

This work focuses on the use of acoustic cues for modeling turn-taking i...
research
06/25/2016

Leveraging Semantic Web Search and Browse Sessions for Multi-Turn Spoken Dialog Systems

Training statistical dialog models in spoken dialog systems (SDS) requir...
research
06/29/2018

Investigating Speech Features for Continuous Turn-Taking Prediction Using LSTMs

For spoken dialog systems to conduct fluid conversational interactions w...

Please sign up or login with your details

Forgot password? Click here to reset