A neural prosody encoder for end-ro-end dialogue act classification

05/11/2022
by   Kai Wei, et al.
0

Dialogue act classification (DAC) is a critical task for spoken language understanding in dialogue systems. Prosodic features such as energy and pitch have been shown to be useful for DAC. Despite their importance, little research has explored neural approaches to integrate prosodic features into end-to-end (E2E) DAC models which infer dialogue acts directly from audio signals. In this work, we propose an E2E neural architecture that takes into account the need for characterizing prosodic phenomena co-occurring at different levels inside an utterance. A novel part of this architecture is a learnable gating mechanism that assesses the importance of prosodic features and selectively retains core information necessary for E2E DAC. Our proposed model improves DAC accuracy by 1.07

READ FULL TEXT
research
05/16/2018

A Context-based Approach for Dialogue Act Recognition using Simple Recurrent Neural Networks

Dialogue act recognition is an important part of natural language unders...
research
12/13/2021

Attentive Contextual Carryover for Multi-Turn End-to-End Spoken Language Understanding

Recent years have seen significant advances in end-to-end (E2E) spoken l...
research
05/04/2023

End-to-end spoken language understanding using joint CTC loss and self-supervised, pretrained acoustic encoders

It is challenging to extract semantic meanings directly from audio signa...
research
10/28/2020

Handling Class Imbalance in Low-Resource Dialogue Systems by Combining Few-Shot Classification and Interpolation

Utterance classification performance in low-resource dialogue systems is...
research
04/30/2020

Hierarchical Encoders for Modeling and Interpreting Screenplays

While natural language understanding of long-form documents is still an ...
research
05/14/2023

Improving End-to-End SLU performance with Prosodic Attention and Distillation

Most End-to-End SLU methods depend on the pretrained ASR or language mod...
research
11/15/2017

Dialogue Act Recognition via CRF-Attentive Structured Network

Dialogue Act Recognition (DAR) is a challenging problem in dialogue inte...

Please sign up or login with your details

Forgot password? Click here to reset