Exploring Textual and Speech information in Dialogue Act Classification with Speaker Domain Adaptation

10/17/2018
by   Xuanli He, et al.
0

In spite of the recent success of Dialogue Act (DA) classification, the majority of prior works focus on text-based classification with oracle transcriptions, i.e. human transcriptions, instead of Automatic Speech Recognition (ASR)'s transcriptions. In spoken dialog systems, however, the agent would only have access to noisy ASR transcriptions, which may further suffer performance degradation due to domain shift. In this paper, we explore the effectiveness of using both acoustic and textual signals, either oracle or ASR transcriptions, and investigate speaker domain adaptation for DA classification. Our multimodal model proves to be superior to the unimodal models, particularly when the oracle transcriptions are not available. We also propose an effective method for speaker domain adaptation, which achieves competitive results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/08/2021

Exploring Machine Speech Chain for Domain Adaptation and Few-Shot Speaker Adaptation

Machine Speech Chain, which integrates both end-to-end (E2E) automatic s...
research
04/23/2020

End-to-end speech-to-dialog-act recognition

Spoken language understanding, which extracts intents and/or semantic co...
research
06/22/2022

A Simple Baseline for Domain Adaptation in End to End ASR Systems Using Synthetic Data

Automatic Speech Recognition(ASR) has been dominated by deep learning-ba...
research
04/16/2019

Mitigating the Impact of Speech Recognition Errors on Spoken Question Answering by Adversarial Domain Adaptation

Spoken question answering (SQA) is challenging due to complex reasoning ...
research
02/28/2019

Context-aware Neural-based Dialog Act Classification on Automatically Generated Transcriptions

This paper presents our latest investigations on dialog act (DA) classif...
research
04/12/2019

Unsupervised Speech Domain Adaptation Based on Disentangled Representation Learning for Robust Speech Recognition

In general, the performance of automatic speech recognition (ASR) system...
research
12/30/2022

TA-DA: Topic-Aware Domain Adaptation for Scientific Keyphrase Identification and Classification (Student Abstract)

Keyphrase identification and classification is a Natural Language Proces...

Please sign up or login with your details

Forgot password? Click here to reset