Robust Open-Set Spoken Language Identification and the CU MultiLang Dataset

08/29/2023
by   Mustafa Eyceoz, et al.
0

Most state-of-the-art spoken language identification models are closed-set; in other words, they can only output a language label from the set of classes they were trained on. Open-set spoken language identification systems, however, gain the ability to detect when an input exhibits none of the original languages. In this paper, we implement a novel approach to open-set spoken language identification that uses MFCC and pitch features, a TDNN model to extract meaningful feature embeddings, confidence thresholding on softmax outputs, and LDA and pLDA for learning to classify new unknown languages. We present a spoken language identification system that achieves 91.76 on trained languages and has the capability to adapt to unknown languages on the fly. To that end, we also built the CU MultiLang Dataset, a large and diverse multilingual speech corpus which was used to train and evaluate our system.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/19/2022

Automatic Spoken Language Identification using a Time-Delay Neural Network

Closed-set spoken language identification is the task of recognizing the...
research
10/22/2020

Rediscovering the Slavic Continuum in Representations Emerging from Neural Models of Spoken Language Identification

Deep neural networks have been employed for various spoken language reco...
research
07/16/2017

Open-Set Language Identification

We present the first open-set language identification experiments using ...
research
05/20/2022

Modernizing Open-Set Speech Language Identification

While most modern speech Language Identification methods are closed-set,...
research
11/29/2018

Tuplemax Loss for Language Identification

In many scenarios of a language identification task, the user will speci...
research
01/29/2020

Improving Language Identification for Multilingual Speakers

Spoken language identification (LID) technologies have improved in recen...
research
06/02/2023

Efficient Spoken Language Recognition via Multilabel Classification

Spoken language recognition (SLR) is the task of automatically identifyi...

Please sign up or login with your details

Forgot password? Click here to reset