Learning discriminative and robust time-frequency representations for environmental sound classification

12/14/2019
by   Helin Wang, et al.
0

Convolutional neural networks (CNN) are one of the best-performing neural network architectures for environmental sound classification (ESC). Recently, attention mechanisms have been used in CNN to capture the useful information from the audio signal for sound classification, especially for weakly labelled data where the timing information about the acoustic events is not available in the training data, apart from the availability of sound class labels. In these methods, however, the inherent time-frequency characteristics and variations are not explicitly exploited when obtaining the deep features. In this paper, we propose a new method, called time-frequency enhancement block (TFBlock), which temporal attention and frequency attention are employed to enhance the features from relevant frames and frequency bands. Compared with other attention mechanisms, in our method, parallel branches are constructed which allow the temporal and frequency features to be attended respectively in order to mitigate interference from the sections where no sound events happened in the acoustic environments. The experiments on three benchmark ESC datasets show that our method improves the classification performance and also exhibits robustness to noise.

READ FULL TEXT
research
08/06/2021

SpecMix : A Mixed Sample Data Augmentation method for Training withTime-Frequency Domain Features

A mixed sample data augmentation strategy is proposed to enhance the per...
research
04/10/2023

In-situ crack and keyhole pore detection in laser directed energy deposition through acoustic signal and deep learning

Cracks and keyhole pores are detrimental defects in alloys produced by l...
research
05/20/2019

Robust sound event detection in bioacoustic sensor networks

Bioacoustic sensors, sometimes known as autonomous recording units (ARUs...
research
04/05/2019

Deep Learning Features for Robust Detection of Acoustic Events in Sleep-Disordered Breathing

Sleep-disordered breathing (SDB) is a serious and prevalent condition, a...
research
06/30/2023

The Human Auditory System and Audio

This work reviews the human auditory system, elucidating some of the spe...
research
04/23/2021

ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio

Environmental Sound Classification (ESC) is a rapidly evolving field tha...
research
02/05/2021

Diversity-Robust Acoustic Feature Signatures Based on Multiscale Fractal Dimension for Similarity Search of Environmental Sounds

This paper proposes new acoustic feature signatures based on the multisc...

Please sign up or login with your details

Forgot password? Click here to reset