UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021

07/26/2021
by   Xinhui Chen, et al.
0

In this paper, we present UR-AIR system submission to the logical access (LA) and the speech deepfake (DF) tracks of the ASVspoof 2021 Challenge. The LA and DF tasks focus on synthetic speech detection (SSD), i.e. detecting text-to-speech and voice conversion as spoofing attacks. Different from previous ASVspoof challenges, the LA task this year presents codec and transmission channel variability, while the new task DF presents general audio compression. Built upon our previous research work on improving the robustness of the SSD systems to channel effects, we propose a channel-robust synthetic speech detection system for the challenge. To mitigate the channel variability issue, we use an acoustic simulator to apply transmission codec, compression codec, and convolutional impulse responses to augmenting the original datasets. For the neural network backbone, we propose to use Emphasized Channel Attention, Propagation and Aggregation Time Delay Neural Networks (ECAPA-TDNN) as our primary model. We also incorporate one-class learning with channel-robust training strategies to further learn a channel-invariant speech representation. Our submission achieved EER 20.33 and min-tDCF 0.3094 in the LA task.

READ FULL TEXT
research
09/01/2021

ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection

ASVspoof 2021 is the forth edition in the series of bi-annual challenges...
research
04/03/2021

An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure Systems

Spoofing countermeasure (CM) systems are critical in speaker verificatio...
research
08/12/2021

RW-Resnet: A Novel Speech Anti-Spoofing Model Using Raw Waveform

In recent years, synthetic speech generated by advanced text-to-speech (...
research
06/06/2023

Phase perturbation improves channel robustness for speech spoofing countermeasures

In this paper, we aim to address the problem of channel robustness in sp...
research
10/05/2022

ASVspoof 2021: Towards Spoofed and Deepfake Speech Detection in the Wild

Benchmarking initiatives support the meaningful comparison of competing ...
research
02/22/2021

Introducing a Novel Data over Voice Technique for Secure Voice Communication

The current increasing need for privacy-preserving voice communications ...
research
03/18/2020

Detecting Replay Attacks Using Multi-Channel Audio: A Neural Network-Based Method

With the rapidly growing number of security-sensitive systems that use v...

Please sign up or login with your details

Forgot password? Click here to reset