Data Augmentation for Environmental Sound Classification Using Diffusion Probabilistic Model with Top-k Selection Discriminator

03/27/2023
by   Yunhao Chen, et al.
0

Despite consistent advancement in powerful deep learning techniques in recent years, large amounts of training data are still necessary for the models to avoid overfitting. Synthetic datasets using generative adversarial networks (GAN) have recently been generated to overcome this problem. Nevertheless, despite advancements, GAN-based methods are usually hard to train or fail to generate high-quality data samples. In this paper, we propose an environmental sound classification augmentation technique based on the diffusion probabilistic model with DPM-Solver++ for fast sampling. In addition, to ensure the quality of the generated spectrograms, we train a top-k selection discriminator on the dataset. According to the experiment results, the synthesized spectrograms have similar features to the original dataset and can significantly increase the classification accuracy of different state-of-the-art models compared with traditional data augmentation techniques. The public code is available on https://github.com/JNAIC/DPMs-for-Audio-Data-Augmentation.

READ FULL TEXT
research
01/12/2023

Diffusion-based Data Augmentation for Skin Disease Classification: Impact Across Original Medical Datasets to Fully Synthetic Images

Despite continued advancement in recent years, deep neural networks stil...
research
11/05/2022

Effective Audio Classification Network Based on Paired Inverse Pyramid Structure and Dense MLP Block

Recently, massive architectures based on Convolutional Neural Network (C...
research
10/18/2022

Deep Data Augmentation for Weed Recognition Enhancement: A Diffusion Probabilistic Model and Transfer Learning Based Approach

Weed management plays an important role in many modern agricultural appl...
research
08/29/2018

DADA: Deep Adversarial Data Augmentation for Extremely Low Data Regime Classification

Deep learning has revolutionized the performance of classification, but ...
research
11/18/2020

DeepNAG: Deep Non-Adversarial Gesture Generation

Synthetic data generation to improve classification performance (data au...
research
04/08/2019

Unsupervised Feature Learning for Environmental Sound Classification Using Cycle Consistent Generative Adversarial Network

In this paper we propose a novel environmental sound classification appr...
research
05/27/2023

Toward Understanding Generative Data Augmentation

Generative data augmentation, which scales datasets by obtaining fake la...

Please sign up or login with your details

Forgot password? Click here to reset