Frequency Domain-Based Detection of Generated Audio

05/03/2022
by   Emily R. Bartusiak, et al.
0

Attackers may manipulate audio with the intent of presenting falsified reports, changing an opinion of a public figure, and winning influence and power. The prevalence of inauthentic multimedia continues to rise, so it is imperative to develop a set of tools that determines the legitimacy of media. We present a method that analyzes audio signals to determine whether they contain real human voices or fake human voices (i.e., voices generated by neural acoustic and waveform models). Instead of analyzing the audio signals directly, the proposed approach converts the audio signals into spectrogram images displaying frequency, intensity, and temporal content and evaluates them with a Convolutional Neural Network (CNN). Trained on both genuine human voice signals and synthesized voice signals, we show our approach achieves high accuracy on this classification task.

READ FULL TEXT

page 1

page 3

research
05/10/2019

Multiclass Language Identification using Deep Learning on Spectral Images of Audio Signals

The first step in any voice recognition software is to determine what la...
research
04/08/2019

Audio Classification of Bit-Representation Waveform

This paper investigates waveform representation for audio signal classif...
research
04/25/2023

AI-Synthesized Voice Detection Using Neural Vocoder Artifacts

Advancements in AI-synthesized human voices have created a growing threa...
research
02/18/2023

Exposing AI-Synthesized Human Voices Using Neural Vocoder Artifacts

The advancements of AI-synthesized human voices have introduced a growin...
research
12/02/2022

AccEar: Accelerometer Acoustic Eavesdropping with Unconstrained Vocabulary

With the increasing popularity of voice-based applications, acoustic eav...
research
02/14/2020

Acoustic Scene Classification Using Bilinear Pooling on Time-liked and Frequency-liked Convolution Neural Network

The current methodology in tackling Acoustic Scene Classification (ASC) ...
research
03/28/2022

Attacker Attribution of Audio Deepfakes

Deepfakes are synthetically generated media often devised with malicious...

Please sign up or login with your details

Forgot password? Click here to reset