Modelling Data Dispersion Degree in Automatic Robust Estimation for Multivariate Gaussian Mixture Models with an Application to Noisy Speech Processing

05/19/2014
by   Dalei Wu, et al.
0

The trimming scheme with a prefixed cutoff portion is known as a method of improving the robustness of statistical models such as multivariate Gaussian mixture models (MG- MMs) in small scale tests by alleviating the impacts of outliers. However, when this method is applied to real- world data, such as noisy speech processing, it is hard to know the optimal cut-off portion to remove the outliers and sometimes removes useful data samples as well. In this paper, we propose a new method based on measuring the dispersion degree (DD) of the training data to avoid this problem, so as to realise automatic robust estimation for MGMMs. The DD model is studied by using two different measures. For each one, we theoretically prove that the DD of the data samples in a context of MGMMs approximately obeys a specific (chi or chi-square) distribution. The proposed method is evaluated on a real-world application with a moderately-sized speaker recognition task. Experiments show that the proposed method can significantly improve the robustness of the conventional training method of GMMs for speaker recognition.

READ FULL TEXT
research
11/28/2021

Schema matching using Gaussian mixture models with Wasserstein distance

Gaussian mixture models find their place as a powerful tool, mostly in t...
research
12/19/2020

Robust mixture regression with Exponential Power distribution

Assuming an exponential power distribution is one way to deal with outli...
research
07/02/2019

Using Subset Log-Likelihoods to Trim Outliers in Gaussian Mixture Models

Mixtures of Gaussian distributions are a popular choice in model-based c...
research
08/02/2018

Histogram Transform-based Speaker Identification

A novel text-independent speaker identification (SI) method is proposed....
research
04/08/2020

Robust Mixture Modeling using Weighted Complete Estimating Equations

Mixture modeling that takes account of potential heterogeneity in data i...
research
12/16/2015

A Novel Minimum Divergence Approach to Robust Speaker Identification

In this work, a novel solution to the speaker identification problem is ...
research
10/31/2017

Nebula: F0 Estimation and Voicing Detection by Modeling the Statistical Properties of Feature Extractors

A F0 and voicing status estimation algorithm for speech analysis/synthes...

Please sign up or login with your details

Forgot password? Click here to reset