DeepAI AI Chat
Log In Sign Up

End-to-End Residual CNN with L-GM Loss Speaker Verification System

05/02/2018
by   Xuan Shi, et al.
Shanghai University
0

We propose an end-to-end speaker verification system based on the neural network and trained by a loss function with less computational complexity. The end-to-end speaker verification system consists of a ResNet architecture to extract features from utterance, then mean pool to produces utterance- level speaker embeddings, and train using the large-margin Gaussian Mixture loss function. Influenced by the large-margin and likelihood regularization, large-margin Gaussian Mixture loss function benefits the speaker verification performance. Experimental results demonstrate that the Residual CNN with large- margin Gaussian Mixture loss outperforms DNN-based i-vector baseline by nearly 10

READ FULL TEXT

page 1

page 2

page 3

page 4

04/06/2019

Large Margin Softmax Loss for Speaker Verification

In neural network based speaker verification, speaker embedding is expec...
10/28/2017

Generalized End-to-End Loss for Speaker Verification

In this paper, we propose a new loss function called generalized end-to-...
04/05/2021

Dr-Vectors: Decision Residual Networks and an Improved Loss for Speaker Recognition

Many neural network speaker recognition systems model each speaker using...
11/07/2018

Generative Adversarial Speaker Embedding Networks for Domain Robust End-to-End Speaker Verification

This article presents a novel approach for learning domain-invariant spe...
03/08/2018

Rethinking Feature Distribution for Loss Functions in Image Classification

We propose a large-margin Gaussian Mixture (L-GM) loss for deep neural n...
05/03/2018

Supervector Compression Strategies to Speed up I-Vector System Development

The front-end factor analysis (FEFA), an extension of principal componen...