Detector Discovery in the Wild: Joint Multiple Instance and Representation Learning

12/02/2014
by   Judy Hoffman, et al.
0

We develop methods for detector learning which exploit joint training over both weak and strong labels and which transfer learned perceptual representations from strongly-labeled auxiliary tasks. Previous methods for weak-label learning often learn detector models independently using latent variable optimization, but fail to share deep representation knowledge across classes and usually require strong initialization. Other previous methods transfer deep representations from domains with strong labels to those with only weak labels, but do not optimize over individual latent boxes, and thus may miss specific salient structures for a particular category. We propose a model that subsumes these previous approaches, and simultaneously trains a representation and detectors for categories with either weak or strong labels present. We provide a novel formulation of a joint multiple instance learning method that includes examples from classification-style data when available, and also performs domain transfer learning to improve the underlying detector representation. Our model outperforms known methods on ImageNet-200 detection with weak labels.

READ FULL TEXT

page 1

page 3

page 6

page 7

page 8

research
02/19/2020

Strength from Weakness: Fast Learning Using Weak Supervision

We study generalization properties of weakly supervised learning. That i...
research
02/05/2020

Limitations of weak labels for embedding and tagging

While many datasets and approaches in ambient sound analysis use weakly ...
research
04/23/2019

Transferable Semi-supervised 3D Object Detection from RGB-D Data

We investigate the direction of training a 3D object detector for new ob...
research
05/14/2021

The Benefit Of Temporally-Strong Labels In Audio Event Classification

To reveal the importance of temporal precision in ground truth audio eve...
research
06/29/2021

Open-Set Representation Learning through Combinatorial Embedding

Visual recognition tasks are often limited to dealing with a small subse...
research
07/10/2020

Overcoming label noise in audio event detection using sequential labeling

This paper addresses the noisy label issue in audio event detection (AED...
research
03/24/2022

Rich Feature Construction for the Optimization-Generalization Dilemma

There often is a dilemma between ease of optimization and robust out-of-...

Please sign up or login with your details

Forgot password? Click here to reset