Improving Embedding Extraction for Speaker Verification with Ladder Network

03/20/2020
by   Fei Tao, et al.
0

Speaker verification is an established yet challenging task in speech processing and a very vibrant research area. Recent speaker verification (SV) systems rely on deep neural networks to extract high-level embeddings which are able to characterize the users' voices. Most of the studies have investigated on improving the discriminability of the networks to extract better embeddings for performances improvement. However, only few research focus on improving the generalization. In this paper, we propose to apply the ladder network framework in the SV systems, which combines the supervised and unsupervised learning fashions. The ladder network can make the system to have better high-level embedding by balancing the trade-off to keep/discard as much useful/useless information as possible. We evaluated the framework on two state-of-the-art SV systems, d-vector and x-vector, which can be used for different use cases. The experiments showed that the proposed approach relatively improved the performance by 10

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/24/2020

Raw-x-vector: Multi-scale Time Domain Speaker Embedding Network

State-of-the-art text-independent speaker verification systems typically...
research
03/28/2019

Deep Neural Network Embedding Learning with High-Order Statistics for Text-Independent Speaker Verification

The x-vector based deep neural network (DNN) embedding systems have demo...
research
04/13/2018

Speaker Embedding Extraction with Phonetic Information

Speaker embeddings achieve promising results on many speaker verificatio...
research
04/14/2021

Learning Metrics from Mean Teacher: A Supervised Learning Method for Improving the Generalization of Speaker Verification System

Most speaker verification tasks are studied as an open-set evaluation sc...
research
01/16/2023

Improving Target Speaker Extraction with Sparse LDA-transformed Speaker Embeddings

As a practical alternative of speech separation, target speaker extracti...
research
02/28/2022

Magnitude-aware Probabilistic Speaker Embeddings

Recently, hyperspherical embeddings have established themselves as a dom...
research
08/07/2020

Disentangled speaker and nuisance attribute embedding for robust speaker verification

Over the recent years, various deep learning-based embedding methods hav...

Please sign up or login with your details

Forgot password? Click here to reset