RNN Transducer Models For Spoken Language Understanding

04/08/2021
by   Samuel Thomas, et al.
0

We present a comprehensive study on building and adapting RNN transducer (RNN-T) models for spoken language understanding(SLU). These end-to-end (E2E) models are constructed in three practical settings: a case where verbatim transcripts are available, a constrained case where the only available annotations are SLU labels and their values, and a more restrictive case where transcripts are available but not corresponding audio. We show how RNN-T SLU models can be developed starting from pre-trained automatic speech recognition (ASR) systems, followed by an SLU adaptation step. In settings where real audio data is not available, artificially synthesized speech is used to successfully adapt various SLU models. When evaluated on two SLU data sets, the ATIS corpus and a customer call center data set, the proposed models closely track the performance of other E2E models and achieve state-of-the-art results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/15/2020

Exploring Transfer Learning For End-to-End Spoken Language Understanding

Voice Assistants such as Alexa, Siri, and Google Assistant typically use...
research
02/23/2018

Towards end-to-end spoken language understanding

Spoken language understanding system is traditionally designed as a pipe...
research
05/02/2023

A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge

Recently there have been efforts to introduce new benchmark tasks for sp...
research
11/19/2021

SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural Speech

Progress in speech processing has been facilitated by shared datasets an...
research
01/28/2022

Improving End-to-End Models for Set Prediction in Spoken Language Understanding

The goal of spoken language understanding (SLU) systems is to determine ...
research
10/07/2019

Adapting a FrameNet Semantic Parser for Spoken Language Understanding Using Adversarial Learning

This paper presents a new semantic frame parsing model, based on Berkele...
research
03/16/2023

Trustera: A Live Conversation Redaction System

Trustera, the first functional system that redacts personally identifiab...

Please sign up or login with your details

Forgot password? Click here to reset