End-to-end Spoken Language Understanding with Tree-constrained Pointer Generator

10/29/2022
by   Guangzhi Sun, et al.
0

End-to-end spoken language understanding (SLU) suffers from the long-tail word problem. This paper exploits contextual biasing, a technique to improve the speech recognition of rare words, in end-to-end SLU systems. Specifically, a tree-constrained pointer generator (TCPGen), a powerful and efficient biasing model component, is studied, which leverages a slot shortlist with corresponding entities to extract biasing lists. Meanwhile, to bias the SLU model output slot distribution, a slot probability biasing (SPB) mechanism is proposed to calculate a slot distribution from TCPGen. Experiments on the SLURP dataset showed consistent SLU-F1 improvements using TCPGen and SPB, especially on unseen entities. On a new split by holding out 5 slot types for the test, TCPGen with SPB achieved zero-shot learning with an SLU-F1 score over 50 compared to baselines which can not deal with it. In addition to slot filling, the intent classification accuracy was also improved.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/30/2020

End-to-End Spoken Language Understanding Without Full Transcripts

An essential component of spoken language understanding (SLU) is slot fi...
research
09/06/2016

Joint Online Spoken Language Understanding and Language Modeling with Recurrent Neural Networks

Speaker intent detection and semantic slot filling are two critical task...
research
06/24/2016

Sequential Convolutional Neural Networks for Slot Filling in Spoken Language Understanding

We investigate the usage of convolutional neural networks (CNNs) for the...
research
08/19/2018

Source-Critical Reinforcement Learning for Transferring Spoken Language Understanding to a New Language

To deploy a spoken language understanding (SLU) model to a new language,...
research
04/03/2021

Intent Recognition and Unsupervised Slot Identification for Low Resourced Spoken Dialog Systems

Intent Recognition and Slot Identification are crucial components in spo...
research
04/09/2019

A Hierarchical Decoding Model For Spoken Language Understanding From Unaligned Data

Spoken language understanding (SLU) systems can be trained on two types ...
research
11/06/2018

CIS at TAC Cold Start 2015: Neural Networks and Coreference Resolution for Slot Filling

This paper describes the CIS slot filling system for the TAC Cold Start ...

Please sign up or login with your details

Forgot password? Click here to reset