A Nonparametric Bayesian Approach for Spoken Term detection by Example Query

State of the art speech recognition systems use data-intensive context-dependent phonemes as acoustic units. However, these approaches do not translate well to low resourced languages where large amounts of training data is not available. For such languages, automatic discovery of acoustic units is critical. In this paper, we demonstrate the application of nonparametric Bayesian models to acoustic unit discovery. We show that the discovered units are correlated with phonemes and therefore are linguistically meaningful. We also present a spoken term detection (STD) by example query algorithm based on these automatically learned units. We show that our proposed system produces a P@N of 61.2 EER is 5 literature.

READ FULL TEXT
research
02/16/2018

Bayesian Models for Unit Discovery on a Very Low Resource Language

Developing speech technologies for low-resource languages has become a v...
research
08/19/2019

Two-Staged Acoustic Modeling Adaption for Robust Speech Recognition by the Example of German Oral History Interviews

In automatic speech recognition, often little training data is available...
research
12/19/2017

Subword and Crossword Units for CTC Acoustic Models

This paper proposes a novel approach to create an unit set for CTC based...
research
04/08/2019

Bayesian Subspace Hidden Markov Model for Acoustic Unit Discovery

This work tackles the problem of learning a set of language specific aco...
research
09/07/2015

Unsupervised Spoken Term Detection with Spoken Queries by Multi-level Acoustic Patterns with Varying Model Granularity

This paper presents a new approach for unsupervised Spoken Term Detectio...
research
11/28/2017

Unsupervised Discovery of Structured Acoustic Tokens with Applications to Spoken Term Detection

In this paper, we compare two paradigms for unsupervised discovery of st...

Please sign up or login with your details

Forgot password? Click here to reset