Channel-wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks

07/19/2021
by   Xu Li, et al.
0

Existing approaches for anti-spoofing in automatic speaker verification (ASV) still lack generalizability to unseen attacks. The Res2Net approach designs a residual-like connection between feature groups within one block, which increases the possible receptive fields and improves the system's detection generalizability. However, such a residual-like connection is performed by a direct addition between feature groups without channel-wise priority. We argue that the information across channels may not contribute to spoofing cues equally, and the less relevant channels are expected to be suppressed before adding onto the next feature group, so that the system can generalize better to unseen attacks. This argument motivates the current work that presents a novel, channel-wise gated Res2Net (CG-Res2Net), which modifies Res2Net to enable a channel-wise gating mechanism in the connection between feature groups. This gating mechanism dynamically selects channel-wise features based on the input, to suppress the less relevant channels and enhance the detection generalizability. Three gating mechanisms with different structures are proposed and integrated into Res2Net. Experimental results conducted on ASVspoof 2019 logical access (LA) demonstrate that the proposed CG-Res2Net significantly outperforms Res2Net on both the overall LA evaluation set and individual difficult unseen attacks, which also outperforms other state-of-the-art single systems, depicting the effectiveness of our method.

READ FULL TEXT
research
10/28/2020

Replay and Synthetic Speech Detection with Res2net Architecture

Existing approaches for replay and synthetic speech detection still lack...
research
11/04/2022

SAMO: Speaker Attractor Multi-Center One-Class Learning for Voice Anti-Spoofing

Voice anti-spoofing systems are crucial auxiliaries for automatic speake...
research
08/12/2021

RW-Resnet: A Novel Speech Anti-Spoofing Model Using Raw Waveform

In recent years, synthetic speech generated by advanced text-to-speech (...
research
07/29/2015

STC Anti-spoofing Systems for the ASVspoof 2015 Challenge

This paper presents the Speech Technology Center (STC) systems submitted...
research
09/01/2021

Physiological-Physical Feature Fusion for Automatic Voice Spoofing Detection

Speaker verification systems have been used in many production scenarios...
research
07/13/2019

Detecting Spoofing Attacks Using VGG and SincNet: BUT-Omilia Submission to ASVspoof 2019 Challenge

In this paper, we present the system description of the joint efforts of...
research
04/08/2021

Graph Attention Networks for Anti-Spoofing

The cues needed to detect spoofing attacks against automatic speaker ver...

Please sign up or login with your details

Forgot password? Click here to reset