Bayesian Models for Unit Discovery on a Very Low Resource Language

02/16/2018
by   Lucas Ondel, et al.
0

Developing speech technologies for low-resource languages has become a very active research field over the last decade. Among others, Bayesian models have shown some promising results on artificial examples but still lack of in situ experiments. Our work applies state-of-the-art Bayesian models to unsupervised Acoustic Unit Discovery (AUD) in a real low-resource language scenario. We also show that Bayesian models can naturally integrate information from other resourceful languages by means of informative prior leading to more consistent discovered units. Finally, discovered acoustic units are used, either as the 1-best sequence or as a lattice, to perform word segmentation. Word segmentation results show that this Bayesian approach clearly outperforms a Segmental-DTW baseline on the same corpus.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/04/2020

A Hierarchical Subspace Model for Language-Attuned Acoustic Unit Discovery

In this work, we propose a hierarchical subspace model for acoustic unit...
research
06/08/2021

Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings

When documenting oral-languages, Unsupervised Word Segmentation (UWS) fr...
research
06/18/2018

Unsupervised Word Segmentation from Speech with Attention

We present a first attempt to perform attentional word segmentation dire...
research
06/20/2016

A Nonparametric Bayesian Approach for Spoken Term detection by Example Query

State of the art speech recognition systems use data-intensive context-d...
research
04/08/2019

Bayesian Subspace Hidden Markov Model for Acoustic Unit Discovery

This work tackles the problem of learning a set of language specific aco...
research
09/10/2020

Exploration of End-to-end Synthesisers forZero Resource Speech Challenge 2020

A Spoken dialogue system for an unseen language is referred to as Zero r...
research
06/29/2019

Empirical Evaluation of Sequence-to-Sequence Models for Word Discovery in Low-resource Settings

Since Bahdanau et al. [1] first introduced attention for neural machine ...

Please sign up or login with your details

Forgot password? Click here to reset