Improving End-to-End Models for Set Prediction in Spoken Language Understanding

01/28/2022
by   Hong-Kwang J. Kuo, et al.
0

The goal of spoken language understanding (SLU) systems is to determine the meaning of the input speech signal, unlike speech recognition which aims to produce verbatim transcripts. Advances in end-to-end (E2E) speech modeling have made it possible to train solely on semantic entities, which are far cheaper to collect than verbatim transcripts. We focus on this set prediction problem, where entity order is unspecified. Using two classes of E2E models, RNN transducers and attention based encoder-decoders, we show that these models work best when the training entity sequence is arranged in spoken order. To improve E2E SLU models when entity spoken order is unknown, we propose a novel data augmentation technique along with an implicit attention based alignment method to infer the spoken order. F1 scores significantly increased by more than 11 outperforming previously reported results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/30/2020

End-to-End Spoken Language Understanding Without Full Transcripts

An essential component of spoken language understanding (SLU) is slot fi...
research
09/29/2019

Recent Advances in End-to-End Spoken Language Understanding

This work investigates spoken language understanding (SLU) systems in th...
research
05/07/2016

Adobe-MIT submission to the DSTC 4 Spoken Language Understanding pilot task

The Dialog State Tracking Challenge 4 (DSTC 4) proposes several pilot ta...
research
08/08/2020

Deep F-measure Maximization for End-to-End Speech Understanding

Spoken language understanding (SLU) datasets, like many other machine le...
research
07/01/2022

Toward Low-Cost End-to-End Spoken Language Understanding

Recent advances in spoken language understanding benefited from Self-Sup...
research
04/08/2021

RNN Transducer Models For Spoken Language Understanding

We present a comprehensive study on building and adapting RNN transducer...
research
08/13/2020

Large-scale Transfer Learning for Low-resource Spoken Language Understanding

End-to-end Spoken Language Understanding (SLU) models are made increasin...

Please sign up or login with your details

Forgot password? Click here to reset