Spoof detection using x-vector and feature switching

04/16/2019
by   Mari Ganesh Kumar, et al.
0

Detecting spoofed utterances is a fundamental problem in voice-based biometrics. Spoofing can be performed either by logical accesses like speech synthesis, voice conversion or by physical accesses such as replaying the pre-recorded utterance. Inspired by the state-of-the-art x-vector based speaker verification approach, this paper proposes a deep neural network (DNN) architecture for spoof detection from both logical and physical access. A novelty of the x-vector approach vis-a-vis conventional DNN based systems is that it can handle variable length utterances during testing. Performance of the proposed x-vector systems and the baseline Gaussian mixture model (GMM) systems is analyzed on the ASV-spoof-2019 dataset. The proposed system surpasses the GMM system for physical access, whereas the GMM system detects logical access better. Compared to the GMM systems, the proposed x-vector approach gives an average relative improvement of 14.64 When combined with the decision-level feature switching (DLFS) paradigm, the best system in the proposed approach outperforms the best baseline systems with a relative improvement of 67.48 access in terms of minimum tandem cost detection function (min-t-DCF), respectively.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/16/2018

Deep neural network based i-vector mapping for speaker verification using short utterances

Text-independent speaker recognition using short utterances is a highly ...
research
11/05/2019

The ASVspoof 2019 database

Automatic speaker verification (ASV) is one of the most natural and conv...
research
06/30/2019

Deep Residual Neural Networks for Audio Spoofing Detection

The state-of-art models for speech synthesis and voice conversion are ca...
research
09/01/2021

Physiological-Physical Feature Fusion for Automatic Voice Spoofing Detection

Speaker verification systems have been used in many production scenarios...
research
12/31/2019

Statistical Models in Forensic Voice Comparison

This chapter describes a number of signal-processing and statistical-mod...
research
04/11/2022

The PartialSpoof Database and Countermeasures for the Detection of Short Generated Audio Segments Embedded in a Speech Utterance

Automatic speaker verification is susceptible to various manipulations a...
research
07/13/2019

BUT VOiCES 2019 System Description

This is a description of our effort in VOiCES 2019 Speaker Recognition c...

Please sign up or login with your details

Forgot password? Click here to reset