Unsupervised learning from video to detect foreground objects in single images

03/31/2017
by   Ioana Croitoru, et al.
0

Unsupervised learning from visual data is one of the most difficult challenges in computer vision, being a fundamental task for understanding how visual recognition works. From a practical point of view, learning from unsupervised visual input has an immense practical value, as very large quantities of unlabeled videos can be collected at low cost. In this paper, we address the task of unsupervised learning to detect and segment foreground objects in single images. We achieve our goal by training a student pathway, consisting of a deep neural network. It learns to predict from a single input image (a video frame) the output for that particular frame, of a teacher pathway that performs unsupervised object discovery in video. Our approach is different from the published literature that performs unsupervised discovery in videos or in collections of images at test time. We move the unsupervised discovery phase during the training stage, while at test time we apply the standard feed-forward processing along the student pathway. This has a dual benefit: firstly, it allows in principle unlimited possibilities of learning and generalization during training, while remaining very fast at testing. Secondly, the student not only becomes able to detect in single images significantly better than its unsupervised video discovery teacher, but it also achieves state of the art results on two important current benchmarks, YouTube Objects and Object Discovery datasets. Moreover, at test time, our system is at least two orders of magnitude faster than other previous methods.

READ FULL TEXT

page 6

page 7

page 8

page 9

page 10

page 11

research
08/14/2018

Unsupervised learning of foreground object detection

Unsupervised learning poses one of the most difficult challenges in comp...
research
04/05/2019

Unsupervised Image Matching and Object Discovery as Optimization

Learning with complete or partial supervision is powerful but relies on ...
research
03/25/2023

EfficientAD: Accurate Visual Anomaly Detection at Millisecond-Level Latencies

Detecting anomalies in images is an important task, especially in real-t...
research
04/06/2016

Learning to Track at 100 FPS with Deep Regression Networks

Machine learning techniques are often used in computer vision due to the...
research
06/01/2018

A Classification approach towards Unsupervised Learning of Visual Representations

In this paper, we present a technique for unsupervised learning of visua...
research
11/16/2016

Unsupervised Learning of Important Objects from First-Person Videos

A first-person camera, placed at a person's head, captures, which object...
research
05/12/2018

I Have Seen Enough: A Teacher Student Network for Video Classification Using Fewer Frames

Over the past few years, various tasks involving videos such as classifi...

Please sign up or login with your details

Forgot password? Click here to reset