Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

06/18/2019
by   Xu Xiang, et al.
0

Recently, speaker embeddings extracted from a speaker discriminative deep neural network (DNN) yield better performance than the conventional methods such as i-vector. In most cases, the DNN speaker classifier is trained using cross entropy loss with softmax. However, this kind of loss function does not explicitly encourage inter-class separability and intra-class compactness. As a result, the embeddings are not optimal for speaker recognition tasks. In this paper, to address this issue, three different margin based losses which not only separate classes but also demand a fixed margin between classes are introduced to deep speaker embedding learning. It could be demonstrated that the margin is the key to obtain more discriminative speaker embeddings. Experiments are conducted on two public text independent tasks: VoxCeleb1 and Speaker in The Wild (SITW). The proposed approach can achieve the state-of-the-art performance, with 25 on both tasks when compared to strong baselines using cross entropy loss with softmax, obtaining 2.238 core-core test set, respectively.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/15/2021

Adaptive Margin Circle Loss for Speaker Verification

Deep-Neural-Network (DNN) based speaker verification sys-tems use the an...
research
04/25/2022

Back-ends Selection for Deep Speaker Embeddings

Probabilistic Linear Discriminant Analysis (PLDA) was the dominant and n...
research
12/05/2017

OLÉ: Orthogonal Low-rank Embedding, A Plug and Play Geometric Loss for Deep Learning

Deep neural networks trained using a softmax layer at the top and the cr...
research
04/08/2022

Scoring of Large-Margin Embeddings for Speaker Verification: Cosine or PLDA?

The emergence of large-margin softmax cross-entropy losses in training d...
research
08/12/2019

A Study on Angular Based Embedding Learning for Text-independent Speaker Verification

Learning a good speaker embedding is important for many automatic speake...
research
09/12/2021

A Decidability-Based Loss Function

Nowadays, deep learning is the standard approach for a wide range of pro...
research
10/18/2021

Real Additive Margin Softmax for Speaker Verification

The additive margin softmax (AM-Softmax) loss has delivered remarkable p...

Please sign up or login with your details

Forgot password? Click here to reset