Deep learning methods in speaker recognition: a review

11/14/2019
by   Dávid Sztahó, et al.
0

This paper summarizes the applied deep learning practices in the field of speaker recognition, both verification and identification. Speaker recognition has been a widely used field topic of speech technology. Many research works have been carried out and little progress has been achieved in the past 5-6 years. However, as deep learning techniques do advance in most machine learning fields, the former state-of-the-art methods are getting replaced by them in speaker recognition too. It seems that DL becomes the now state-of-the-art solution for both speaker verification and identification. The standard x-vectors, additional to i-vectors, are used as baseline in most of the novel works. The increasing amount of gathered data opens up the territory to DL, where they are the most effective.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/10/2020

Generalized LSTM-based End-to-End Text-Independent Speaker Verification

The increasing amount of available data and more affordable hardware sol...
research
01/24/2021

A Review of Speaker Diarization: Recent Advances with Deep Learning

Speaker diarization is a task to label audio or video recordings with cl...
research
10/09/2021

Towards Lightweight Applications: Asymmetric Enroll-Verify Structure for Speaker Verification

With the development of deep learning, automatic speaker verification ha...
research
03/22/2022

Speaker recognition with a MLP classifier and LPCC codebook

This paper improves the speaker recognition rates of a MLP classifier an...
research
10/12/2020

A Lightweight Speaker Recognition System Using Timbre Properties

Speaker recognition is an active research area that contains notable usa...
research
09/12/2023

SynVox2: Towards a privacy-friendly VoxCeleb2 dataset

The success of deep learning in speaker recognition relies heavily on th...
research
03/31/2022

Improving speaker de-identification with functional data analysis of f0 trajectories

Due to a constantly increasing amount of speech data that is stored in d...

Please sign up or login with your details

Forgot password? Click here to reset