Comparing the Max and Noisy-Or Pooling Functions in Multiple Instance Learning for Weakly Supervised Sequence Learning Tasks

04/03/2018
by   Yun Wang, et al.
0

Many sequence learning tasks require the localization of certain events in sequences. Because it can be expensive to obtain strong labeling that specifies the starting and ending times of the events, modern systems are often trained with weak labeling without explicit timing information. Multiple instance learning (MIL) is a popular framework for learning from weak labeling. In a common scenario of MIL, it is necessary to choose a pooling function to aggregate the predictions for the individual steps of the sequences. In this paper, we compare the "max" and "noisy-or" pooling functions on a speech recognition task and a sound event detection task. We find that max pooling is able to localize phonemes and sound events, while noisy-or pooling fails. We provide a theoretical explanation of the different behavior of the two pooling functions on sequence learning tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/22/2018

A comparison of five multiple instance learning pooling functions for sound event detection with weak labeling

Sound event detection (SED) entails two subtasks: recognizing what types...
research
07/10/2022

Joint Analysis of Acoustic Scenes and Sound Events with Weakly labeled Data

Considering that acoustic scenes and sound events are closely related to...
research
10/20/2020

Power pooling: An adaptive pooling function for weakly labelled sound event detection

Access to large corpora with strongly labelled sound events is expensive...
research
05/24/2019

Specialized Decision Surface and Disentangled Feature for Weakly-Supervised Polyphonic Sound Event Detection

Sound event detection (SED) is to recognize the presence of sound events...
research
06/12/2021

Improving weakly supervised sound event detection with self-supervised auxiliary tasks

While multitask and transfer learning has shown to improve the performan...
research
10/22/2018

Connectionist Temporal Localization for Sound Event Detection with Sequential Labeling

Research on sound event detection (SED) with weak labeling has mostly fo...
research
07/21/2020

Guided multi-branch learning systems for DCASE 2020 Task 4

In this paper, we describe in detail our systems for DCASE 2020 Task 4. ...

Please sign up or login with your details

Forgot password? Click here to reset