Deep CNN based feature extractor for text-prompted speaker recognition

03/13/2018
by   Sergey Novoselov, et al.
0

Deep learning is still not a very common tool in speaker verification field. We study deep convolutional neural network performance in the text-prompted speaker verification task. The prompted passphrase is segmented into word states - i.e. digits -to test each digit utterance separately. We train a single high-level feature extractor for all states and use cosine similarity metric for scoring. The key feature of our network is the Max-Feature-Map activation function, which acts as an embedded feature selector. By using multitask learning scheme to train the high-level feature extractor we were able to surpass the classic baseline systems in terms of quality and achieved impressive results for such a novice approach, getting 2.85 evaluation set. Fusion of the proposed and the baseline systems improves this result.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/26/2018

On deep speaker embeddings for text-independent speaker recognition

We investigate deep neural network performance in the textindependent sp...
research
04/07/2020

Multi-Scale Aggregation Using Feature Pyramid Module for Text-Independent Speaker Verification

Currently, the most widely used approach for speaker verification is the...
research
10/22/2019

Speech-VGG: A deep feature extractor for speech processing

A growing number of studies in the field of speech processing employ fea...
research
05/16/2020

The INTERSPEECH 2020 Far-Field Speaker Verification Challenge

The INTERSPEECH 2020 Far-Field Speaker Verification Challenge (FFSVC 202...
research
12/21/2020

Multi-stream Convolutional Neural Network with Frequency Selection for Robust Speaker Verification

Speaker verification aims to verify whether an input speech corresponds ...
research
11/03/2020

Small footprint Text-Independent Speaker Verification for Embedded Systems

Deep neural network approaches to speaker verification have proven succe...
research
08/20/2022

Transferable Cross-Tokamak Disruption Prediction with Deep Hybrid Neural Network Feature Extractor

Predicting disruptions across different tokamaks is a great obstacle to ...

Please sign up or login with your details

Forgot password? Click here to reset