Intent Classification Using Pre-Trained Embeddings For Low Resource Languages

10/18/2021
by   Hemant Yadav, et al.
7

Building Spoken Language Understanding (SLU) systems that do not rely on language specific Automatic Speech Recognition (ASR) is an important yet less explored problem in language processing. In this paper, we present a comparative study aimed at employing a pre-trained acoustic model to perform SLU in low resource scenarios. Specifically, we use three different embeddings extracted using Allosaurus, a pre-trained universal phone decoder: (1) Phone (2) Panphone, and (3) Allo embeddings. These embeddings are then used in identifying the spoken intent. We perform experiments across three different languages: English, Sinhala, and Tamil each with different data sizes to simulate high, medium, and low resource scenarios. Our system improves on the state-of-the-art (SOTA) intent classification accuracy by approximately 2.11 for Sinhala and 7.00 Furthermore, we present a quantitative analysis of how the performance scales with the number of training examples used per intent.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/25/2022

On Building Spoken Language Understanding Systems for Low Resourced Languages

Spoken dialog systems are slowly becoming and integral part of the human...
research
11/24/2022

Multitask Learning for Low Resource Spoken Language Understanding

We explore the benefits that multitask learning offer to speech processi...
research
05/15/2021

From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding

The lack of publicly available evaluation data for low-resource language...
research
11/07/2020

Acoustics Based Intent Recognition Using Discovered Phonetic Units for Low Resource Languages

With recent advancements in language technologies, humansare now interac...
research
01/12/2021

A character representation enhanced on-device Intent Classification

Intent classification is an important task in natural language understan...
research
07/12/2021

End-to-End Natural Language Understanding Pipeline for Bangla Conversational Agents

Chatbots are intelligent software built to be used as a replacement for ...
research
03/30/2021

Pre-training for low resource speech-to-intent applications

Designing a speech-to-intent (S2I) agent which maps the users' spoken co...

Please sign up or login with your details

Forgot password? Click here to reset