Instance-level loss based multiple-instance learning for acoustic scene classification

03/16/2022
by   Won-Gook Choi, et al.
0

In acoustic scene classification (ASC) task, an acoustic scene consists of diverse attributes and is inferred by identifying combinations of some distinct attributes among them. This study aims to extract and cluster these attributes effectively using a multiple-instance learning (MIL) framework for ASC. MIL, known as one of the weakly supervised learning methods, is a way to extract instances from input data and infer a scene corresponding to the input data with those unlabeled instances. We develop the MIL framework more suitable for ASC systems, adopting instance-level labels and instance-level loss, which are effective in extracting and clustering instances. As a result, the witness rate increases significantly compared to the framework without instance-level loss and labels. Also in several MIL-based ASC systems, the classification accuracy improves by about 5 to 11 designed a fully separated convolutional module which is a low-complexity neural network consisting of pointwise, frequency-sided depthwise, and temporal-sided depthwise convolutional filters. Considering both complexity and performance, our proposed system is more practical compared to previous systems on the DCASE 2019 challenge task 1-A leader board. We surpassed the third-place model by achieving a performance of 82.3% with only the model complexity of 417K, which is at least 40 times fewer than other systems.

READ FULL TEXT

page 4

page 6

page 10

research
07/25/2020

DD-CNN: Depthwise Disout Convolutional Neural Network for Low-complexity Acoustic Scene Classification

This paper presents a Depthwise Disout Convolutional Neural Network (DD-...
research
07/03/2021

A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification

We propose a novel neural model compression strategy combining data augm...
research
04/10/2019

Acoustic Scene Classification by Implicitly Identifying Distinct Sound Events

In this paper, we propose a new strategy for acoustic scene classificati...
research
06/24/2022

Domain Generalization with Relaxed Instance Frequency-wise Normalization for Multi-device Acoustic Scene Classification

While using two-dimensional convolutional neural networks (2D-CNNs) in i...
research
06/25/2016

Label Tree Embeddings for Acoustic Scene Classification

We present in this paper an efficient approach for acoustic scene classi...
research
06/22/2022

Feature Re-calibration based MIL for Whole Slide Image Classification

Whole slide image (WSI) classification is a fundamental task for the dia...
research
04/15/2021

Attentive Max Feature Map for Acoustic Scene Classification with Joint Learning considering the Abstraction of Classes

The attention mechanism has been widely adopted in acoustic scene classi...

Please sign up or login with your details

Forgot password? Click here to reset