Bayesian Subspace Hidden Markov Model for Acoustic Unit Discovery

04/08/2019
by   Lucas Ondel, et al.
0

This work tackles the problem of learning a set of language specific acoustic units from unlabeled speech recordings given a set of labeled recordings from other languages. Our approach may be described by the following two steps procedure: first the model learns the notion of acoustic units from the labelled data and then the model uses its knowledge to find new acoustic units on the target language. We implement this process with the Bayesian Subspace Hidden Markov Model (SHMM), a model akin to the Subspace Gaussian Mixture Model (SGMM) where each low dimensional embedding represents an acoustic unit rather than just a HMM's state. The subspace is trained on 3 languages from the GlobalPhone corpus (German, Polish and Spanish) and the AUs are discovered on the TIMIT corpus. Results, measured in equivalent Phone Error Rate, show that this approach significantly outperforms previous HMM based acoustic units discovery systems and compares favorably with the Variational Auto Encoder-HMM.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/04/2020

A Hierarchical Subspace Model for Language-Attuned Acoustic Unit Discovery

In this work, we propose a hierarchical subspace model for acoustic unit...
research
05/19/2020

Bayesian Subspace HMM for the Zerospeech 2020 Challenge

In this paper we describe our submission to the Zerospeech 2020 challeng...
research
02/16/2018

Bayesian Models for Unit Discovery on a Very Low Resource Language

Developing speech technologies for low-resource languages has become a v...
research
06/20/2016

A Nonparametric Bayesian Approach for Spoken Term detection by Example Query

State of the art speech recognition systems use data-intensive context-d...
research
05/04/2021

Voice Conversion Based Speaker Normalization for Acoustic Unit Discovery

Discovering speaker independent acoustic units purely from spoken input ...
research
07/31/2020

An Acoustic Segment Model Based Segment Unit Selection Approach to Acoustic Scene Classification with Partial Utterances

In this paper, we propose a sub-utterance unit selection framework to re...
research
06/22/2020

Articulatory-WaveNet: Autoregressive Model For Acoustic-to-Articulatory Inversion

This paper presents Articulatory-WaveNet, a new approach for acoustic-to...

Please sign up or login with your details

Forgot password? Click here to reset