Discover and Learn New Objects from Documentaries

07/30/2017
by   Kai Chen, et al.
0

Despite the remarkable progress in recent years, detecting objects in a new context remains a challenging task. Detectors learned from a public dataset can only work with a fixed list of categories, while training from scratch usually requires a large amount of training data with detailed annotations. This work aims to explore a novel approach -- learning object detectors from documentary films in a weakly supervised manner. This is inspired by the observation that documentaries often provide dedicated exposition of certain object categories, where visual presentations are aligned with subtitles. We believe that object detectors can be learned from such a rich source of information. Towards this goal, we develop a joint probabilistic framework, where individual pieces of information, including video frames and subtitles, are brought together via both visual and linguistic links. On top of this formulation, we further derive a weakly supervised learning algorithm, where object model learning and training set mining are unified in an optimization procedure. Experimental results on a real world dataset demonstrate that this is an effective approach to learning new object detectors.

READ FULL TEXT

page 1

page 3

page 8

research
06/26/2020

Cross-Supervised Object Detection

After learning a new object category from image-level annotations (with ...
research
06/26/2020

Cross-Ssupervised Object Detection

After learning a new object category from image-level annotations (with ...
research
05/02/2017

Transfer Learning by Ranking for Weakly Supervised Object Annotation

Most existing approaches to training object detectors rely on fully supe...
research
10/04/2018

Unsupervised Adversarial Visual Level Domain Adaptation for Learning Video Object Detectors from Images

Deep learning based object detectors require thousands of diversified bo...
research
10/07/2021

Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions

We introduce the task of weakly supervised learning for detecting human ...
research
08/15/2016

Weakly Supervised Object Localization Using Size Estimates

We present a technique for weakly supervised object localization (WSOL),...
research
05/28/2019

SizeNet: Weakly Supervised Learning of Visual Size and Fit in Fashion Images

Finding clothes that fit is a hot topic in the e-commerce fashion indust...

Please sign up or login with your details

Forgot password? Click here to reset