A Comparison of Pooling Methods on LSTM Models for Rare Acoustic Event Classification

02/14/2020
by   Chieh-Chi Kao, et al.
0

Acoustic event classification (AEC) and acoustic event detection (AED) refer to the task of detecting whether specific target events occur in audios. As long short-term memory (LSTM) leads to state-of-the-art results in various speech related tasks, it is employed as a popular solution for AEC as well. This paper focuses on investigating the dynamics of LSTM model on AEC tasks. It includes a detailed analysis on LSTM memory retaining, and a benchmarking of nine different pooling methods on LSTM models using 1.7M generated mixture clips of multiple events with different signal-to-noise ratios. This paper focuses on understanding: 1) utterance-level classification accuracy; 2) sensitivity to event position within an utterance. The analysis is done on the dataset for the detection of rare sound events from DCASE 2017 Challenge. We find max pooling on the prediction level to perform the best among the nine pooling approaches in terms of classification accuracy and insensitivity to event position within an utterance. To authors' best knowledge, this is the first kind of such work focused on LSTM dynamics for AEC tasks.

READ FULL TEXT
research
08/20/2018

A simple model for detection of rare sound events

We propose a simple recurrent model for detecting rare sound events, whe...
research
07/10/2022

Joint Analysis of Acoustic Scenes and Sound Events with Weakly labeled Data

Considering that acoustic scenes and sound events are closely related to...
research
09/20/2018

LSTM-based Whisper Detection

This article presents a whisper speech detector in the far-field domain....
research
04/29/2019

Semi-supervised Acoustic Event Detection based on tri-training

This paper presents our work of training acoustic event detection (AED) ...
research
02/12/2021

Transformer Language Models with LSTM-based Cross-utterance Information Representation

The effective incorporation of cross-utterance information has the poten...
research
09/30/2017

Fine-grained Event Learning of Human-Object Interaction with LSTM-CRF

Event learning is one of the most important problems in AI. However, not...
research
08/26/2022

Static Seeding and Clustering of LSTM Embeddings to Learn from Loosely Time-Decoupled Events

Humans learn from the occurrence of events in a different place and time...

Please sign up or login with your details

Forgot password? Click here to reset