Audio Spoofing Verification using Deep Convolutional Neural Networks by Transfer Learning

08/08/2020
by   Rahul T P, et al.
0

Automatic Speaker Verification systems are gaining popularity these days; spoofing attacks are of prime concern as they make these systems vulnerable. Some spoofing attacks like Replay attacks are easier to implement but are very hard to detect thus creating the need for suitable countermeasures. In this paper, we propose a speech classifier based on deep-convolutional neural network to detect spoofing attacks. Our proposed methodology uses acoustic time-frequency representation of power spectral densities on Mel frequency scale (Mel-spectrogram), via deep residual learning (an adaptation of ResNet-34 architecture). Using a single model system, we have achieved an equal error rate (EER) of 0.9056 logical access scenario and an equal error rate (EER) of 5.87 development and 5.74 ASVspoof 2019.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset