Online Generative-Discriminative Model for Object Detection in Video: An Unsupervised Learning Framework

11/12/2016
by   Dapeng Luo, et al.
0

Traditional single-view object detection methods often perform worse under unconstrained video environments. To address this problem, many modern multi-view detection approaches model complex 3D appearance representations to predict the optimal viewing angle for detection. Most of these approaches require an intensive training process on large database, collected in advance. In this paper, the proposed framework takes a remarkably different direction to resolve multi-view detection problem in a bottom-up fashion. First, a scene-specific objector is obtained from a fully autonomous learning process triggered by marking several bounding boxes around the object in the first video frame via a mouse. Here the human labeled training data or a generic detector are not needed. Second, this learning process is conveniently replicated many times in different surveillance scenes and results in a particular detector under various camera viewpoints. Thus, the proposed framework can be employed in multi-view object detection applications from unsupervised learning process. Obviously, the initial scene-specific detector, initialed by several bounding boxes, exhibits poor detection performance and is difficult to improve with traditional online learning algorithm. Consequently, we propose Generative-Discriminative model to partition detection response space and assign each partition an individual descriptor that progressively achieves high classification accuracy. A novel online gradual learning algorithm is proposed to train the Generative-Discriminative model automatically and focus online learning on the hard samples: the most informative samples lying around the decision boundary. The output is a hybrid classifier based scene-specific detector which achieves decent performance under different viewing angles.

READ FULL TEXT

page 2

page 5

page 8

page 9

page 10

research
09/09/2019

MLOD: A multi-view 3D object detection based on robust feature fusion method

This paper presents Multi-view Labelling Object Detector (MLOD). The det...
research
12/06/2018

Tube-CNN: Modeling temporal evolution of appearance for object detection in video

Object detection in video is crucial for many applications. Compared to ...
research
10/16/2019

Generative Modeling for Small-Data Object Detection

This paper explores object detection in the small data regime, where onl...
research
05/02/2021

RADDet: Range-Azimuth-Doppler based Radar Object Detection for Dynamic Road Users

Object detection using automotive radars has not been explored with deep...
research
06/30/2017

SMC Faster R-CNN: Toward a scene-specialized multi-object detector

Generally, the performance of a generic detector decreases significantly...
research
04/18/2017

Deep Self-Taught Learning for Weakly Supervised Object Localization

Most existing weakly supervised localization (WSL) approaches learn dete...
research
02/23/2023

A novel efficient Multi-view traffic-related object detection framework

With the rapid development of intelligent transportation system applicat...

Please sign up or login with your details

Forgot password? Click here to reset