Deepfake Detection System for the ADD Challenge Track 3.2 Based on Score Fusion

10/13/2022
by   Yuxiang Zhang, et al.
0

This paper describes the deepfake audio detection system submitted to the Audio Deep Synthesis Detection (ADD) Challenge Track 3.2 and gives an analysis of score fusion. The proposed system is a score-level fusion of several light convolutional neural network (LCNN) based models. Various front-ends are used as input features, including low-frequency short-time Fourier transform and Constant Q transform. Due to the complex noise and rich synthesis algorithms, it is difficult to obtain the desired performance using the training set directly. Online data augmentation methods effectively improve the robustness of fake audio detection systems. In particular, the reasons for the poor improvement of score fusion are explored through visualization of the score distributions and comparison with score distribution on another dataset. The overfitting of the model to the training set leads to extreme values of the scores and low correlation of the score distributions, which makes score fusion difficult. Fusion with partially fake audio detection system improves system performance further. The submission on track 3.2 obtained the weighted equal error rate (WEER) of 11.04%, which is one of the best performing systems in the challenge.

READ FULL TEXT

page 7

page 8

research
02/09/2022

CAU_KU team's submission to ADD 2022 Challenge task 1: Low-quality fake audio detection through frequency feature masking

This technical report describes Chung-Ang University and Korea Universit...
research
08/20/2023

The DKU-DUKEECE System for the Manipulation Region Location Task of ADD 2023

This paper introduces our system designed for Track 2, which focuses on ...
research
02/17/2022

ADD 2022: the First Audio Deep Synthesis Detection Challenge

Audio deepfake detection is an emerging topic, which was included in the...
research
03/04/2023

The DKU Post-Challenge Audio-Visual Wake Word Spotting System for the 2021 MISP Challenge: Deep Analysis

This paper further explores our previous wake word spotting system ranke...
research
03/03/2022

The Vicomtech Audio Deepfake Detection System based on Wav2Vec2 for the 2022 ADD Challenge

This paper describes our submitted systems to the 2022 ADD challenge wit...
research
08/20/2022

Fully Automated End-to-End Fake Audio Detection

The existing fake audio detection systems often rely on expert experienc...
research
06/27/2023

TranssionADD: A multi-frame reinforcement based sequence tagging model for audio deepfake detection

Thanks to recent advancements in end-to-end speech modeling technology, ...

Please sign up or login with your details

Forgot password? Click here to reset