Siamese x-vector reconstruction for domain adapted speaker recognition

07/28/2020
by   Shai Rozenberg, et al.
0

With the rise of voice-activated applications, the need for speaker recognition is rapidly increasing. The x-vector, an embedding approach based on a deep neural network (DNN), is considered the state-of-the-art when proper end-to-end training is not feasible. However, the accuracy significantly decreases when recording conditions (noise, sample rate, etc.) are mismatched, either between the x-vector training data and the target data or between enrollment and test data. We introduce the Siamese x-vector Reconstruction (SVR) for domain adaptation. We reconstruct the embedding of a higher quality signal from a lower quality counterpart using a lean auxiliary Siamese DNN. We evaluate our method on several mismatch scenarios and demonstrate significant improvement over the baseline.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/28/2019

Deep Neural Network Embedding Learning with High-Order Statistics for Text-Independent Speaker Verification

The x-vector based deep neural network (DNN) embedding systems have demo...
research
12/26/2018

The CORAL+ Algorithm for Unsupervised Domain Adaptation of PLDA

State-of-the-art speaker recognition systems comprise an x-vector (or i-...
research
06/28/2015

Improved Deep Speaker Feature Learning for Text-Dependent Speaker Recognition

A deep learning approach has been proposed recently to derive speaker id...
research
12/25/2017

Leveraging Native Language Speech for Accent Identification using Deep Siamese Networks

The problem of automatic accent identification is important for several ...
research
04/08/2020

Bayesian x-vector: Bayesian Neural Network based x-vector System for Speaker Verification

Speaker verification systems usually suffer from the mismatch problem be...
research
09/28/2020

Siamese Capsule Network for End-to-End Speaker Recognition In The Wild

We propose an end-to-end deep model for speaker verification in the wild...
research
08/11/2020

Compact Speaker Embedding: lrx-vector

Deep neural networks (DNN) have recently been widely used in speaker rec...

Please sign up or login with your details

Forgot password? Click here to reset