A Unified Deep Neural Network for Speaker and Language Recognition

04/03/2015
by   Fred Richardson, et al.
0

Learned feature representations and sub-phoneme posteriors from Deep Neural Networks (DNNs) have been used separately to produce significant performance gains for speaker and language recognition tasks. In this work we show how these gains are possible using a single DNN for both speaker and language recognition. The unified DNN approach is shown to yield substantial performance improvements on the the 2013 Domain Adaptation Challenge speaker recognition task (55 2011 Language Recognition Evaluation (48 condition).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/28/2015

Improved Deep Speaker Feature Learning for Text-Dependent Speaker Recognition

A deep learning approach has been proposed recently to derive speaker id...
research
08/06/2020

Aalto's End-to-End DNN systems for the INTERSPEECH 2020 Computational Paralinguistics Challenge

End-to-end neural network models (E2E) have shown significant performanc...
research
04/16/2019

I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences

The I4U consortium was established to facilitate a joint entry to NIST s...
research
08/03/2017

Recursive Whitening Transformation for Speaker Recognition on Language Mismatched Condition

Recently in speaker recognition, performance degradation due to the chan...
research
10/25/2019

Structural sparsification for Far-field Speaker Recognition with GNA

Recently, deep neural networks (DNN) have been widely used in speaker re...
research
01/16/2013

Feature Learning in Deep Neural Networks - Studies on Speech Recognition Tasks

Recent studies have shown that deep neural networks (DNNs) perform signi...
research
07/13/2019

BUT VOiCES 2019 System Description

This is a description of our effort in VOiCES 2019 Speaker Recognition c...

Please sign up or login with your details

Forgot password? Click here to reset