Deep progressive multi-scale attention for acoustic event classification

12/27/2019
by   Xugang Lu, et al.
0

Convolutional neural network (CNN) is an indispensable building block for designing a state of the art system for acoustic event classification (AEC). By stacking multiple CNN layers, the model could explore long-range dependency of explored features in top layers with increasing of feature abstraction. However it is also possible that the discriminative features with short-range dependency which are distributed locally are smooth out in the final representation. In this paper, we propose a progressive multi-scale attention (MSA) model which explicitly integrates multi-scale features with short- and long-range dependency in feature extraction. Based on mathematic formulations, we revealed that the conventional residual CNN (ResCNN) model could be explained as a special case of the proposed MSA model, and the MSA model could use the ResCNN as a backbone with an attentive feature weighting in consecutive scales. The discriminative features in multi-scales are progressively propagated to top layers for the final representation. Therefore, the final representation encodes multi-scale features with local and global discriminative structures which are expected to improve the performance. We tested the proposed model on two AEC data corpora, one is for urban acoustic event classification task, the other is for acoustic event detection in smart car environments. Our results showed that the proposed MSA model effectively improved the performance on the current state-of-the-art deep learning algorithms.

READ FULL TEXT
research
05/31/2022

Progressive Multi-scale Consistent Network for Multi-class Fundus Lesion Segmentation

Effectively integrating multi-scale information is of considerable signi...
research
12/16/2022

DQnet: Cross-Model Detail Querying for Camouflaged Object Detection

Camouflaged objects are seamlessly blended in with their surroundings, w...
research
03/25/2018

Learning Environmental Sounds with Multi-scale Convolutional Neural Network

Deep learning has dramatically improved the performance of sounds recogn...
research
03/29/2023

PMAA: A Progressive Multi-scale Attention Autoencoder Model for High-Performance Cloud Removal from Multi-temporal Satellite Imagery

Satellite imagery analysis plays a vital role in remote sensing, but the...
research
10/03/2017

Event Identification as a Decision Process with Non-linear Representation of Text

We propose scale-free Identifier Network(sfIN), a novel model for event ...
research
11/29/2018

Multi-Scale Distributed Representation for Deep Learning and its Application to b-Jet Tagging

Recently machine learning algorithms based on deep layered artificial ne...
research
09/03/2021

Musical Tempo Estimation Using a Multi-scale Network

Recently, some single-step systems without onset detection have shown th...

Please sign up or login with your details

Forgot password? Click here to reset