Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, And Pretraining: An Ablation Study

02/07/2022
by   Daniel Tompkins, et al.
0

An Xception model reaches state-of-the-art (SOTA) accuracy on the ESC-50 dataset for audio event detection through knowledge transfer from ImageNet weights, pretraining on AudioSet, and an on-the-fly data augmentation pipeline. This paper presents an ablation study that analyzes which components contribute to the boost in performance and training time. A smaller Xception model is also presented which nears SOTA performance with almost a third of the parameters.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/02/2021

PSLA: Improving Audio Event Classification with Pretraining, Sampling, Labeling, and Aggregation

Audio event classification is an active research area and has a wide ran...
research
09/02/2023

Pretraining Representations for Bioacoustic Few-shot Detection using Supervised Contrastive Learning

Deep learning has been widely used recently for sound event detection an...
research
05/19/2021

Unsupervised Discriminative Learning of Sounds for Audio Event Classification

Recent progress in network-based audio event classification has shown th...
research
10/12/2021

Spatial mixup: Directional loudness modification as data augmentation for sound event localization and detection

Data augmentation methods have shown great importance in diverse supervi...
research
10/12/2022

Cross-dataset COVID-19 Transfer Learning with Cough Detection, Cough Segmentation, and Data Augmentation

This paper addresses issues on cough-based COVID-19 detection. We propos...
research
11/05/2022

Effective Audio Classification Network Based on Paired Inverse Pyramid Structure and Dense MLP Block

Recently, massive architectures based on Convolutional Neural Network (C...
research
08/10/2021

An empirical investigation into audio pipeline approaches for classifying bird species

This paper is an investigation into aspects of an audio classification p...

Please sign up or login with your details

Forgot password? Click here to reset