Towards Speaker Age Estimation with Label Distribution Learning

02/23/2022
by   Shijing Si, et al.
0

Existing methods for speaker age estimation usually treat it as a multi-class classification or a regression problem. However, precise age identification remains a challenge due to label ambiguity, i.e., utterances from adjacent age of the same person are often indistinguishable. To address this, we utilize the ambiguous information among the age labels, convert each age label into a discrete label distribution and leverage the label distribution learning (LDL) method to fit the data. For each audio data sample, our method produces a age distribution of its speaker, and on top of the distribution we also perform two other tasks: age prediction and age uncertainty minimization. Therefore, our method naturally combines the age classification and regression approaches, which enhances the robustness of our method. We conduct experiments on the public NIST SRE08-10 dataset and a real-world dataset, which exhibit that our method outperforms baseline methods by a relatively large margin, yielding a 10% reduction in terms of mean absolute error (MAE) on a real-world dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/18/2022

SVLDL: Improved Speaker Age Estimation Using Selective Variance Label Distribution Learning

Estimating age from a single speech is a classic and challenging topic. ...
research
09/20/2018

A Coupled Evolutionary Network for Age Estimation

Age estimation of unknown persons is a challenging pattern analysis task...
research
07/13/2022

Cross-Age Speaker Verification: Learning Age-Invariant Speaker Embeddings

Automatic speaker verification has achieved remarkable progress in recen...
research
09/05/2022

Full Kullback-Leibler-Divergence Loss for Hyperparameter-free Label Distribution Learning

The concept of Label Distribution Learning (LDL) is a technique to stabi...
research
11/06/2016

Deep Label Distribution Learning with Label Ambiguity

Convolutional Neural Networks (ConvNets) have achieved excellent recogni...
research
06/17/2021

using multiple losses for accurate facial age estimation

Age estimation is an essential challenge in computer vision. With the ad...
research
07/15/2023

Learning Subjective Time-Series Data via Utopia Label Distribution Approximation

Subjective time-series regression (STR) tasks have gained increasing att...

Please sign up or login with your details

Forgot password? Click here to reset