Improving End-to-End Speech-to-Intent Classification with Reptile

08/05/2020
by   Yusheng Tian, et al.
0

End-to-end spoken language understanding (SLU) systems have many advantages over conventional pipeline systems, but collecting in-domain speech data to train an end-to-end system is costly and time consuming. One question arises from this: how to train an end-to-end SLU with limited amounts of data? Many researchers have explored approaches that make use of other related data resources, typically by pre-training parts of the model on high-resource speech recognition. In this paper, we suggest improving the generalization performance of SLU models with a non-standard learning algorithm, Reptile. Though Reptile was originally proposed for model-agnostic meta learning, we argue that it can also be used to directly learn a target task and result in better generalization than conventional gradient descent. In this work, we employ Reptile to the task of end-to-end spoken intent classification. Experiments on four datasets of different languages and domains show improvement of intent prediction accuracy, both when Reptile is used alone and used in addition to pre-training.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/07/2019

Speech Model Pre-training for End-to-End Spoken Language Understanding

Whereas conventional spoken language understanding (SLU) systems map spe...
research
04/07/2022

Three-Module Modeling For End-to-End Spoken Language Understanding Using Pre-trained DNN-HMM-Based Acoustic-Phonetic Model

In spoken language understanding (SLU), what the user says is converted ...
research
04/08/2022

A Study of Different Ways to Use The Conformer Model For Spoken Language Understanding

SLU combines ASR and NLU capabilities to accomplish speech-to-intent und...
research
03/12/2023

Improving the Intent Classification accuracy in Noisy Environment

Intent classification is a fundamental task in the spoken language under...
research
06/28/2021

Enhancing the Generalization for Intent Classification and Out-of-Domain Detection in SLU

Intent classification is a major task in spoken language understanding (...
research
03/30/2021

Pre-training for low resource speech-to-intent applications

Designing a speech-to-intent (S2I) agent which maps the users' spoken co...
research
10/08/2020

Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems

Training an end-to-end (E2E) neural network speech-to-intent (S2I) syste...

Please sign up or login with your details

Forgot password? Click here to reset