Weakly-Supervised Semantic Segmentation using Motion Cues

03/23/2016
by   Pavel Tokmakov, et al.
0

Fully convolutional neural networks (FCNNs) trained on a large number of images with strong pixel-level annotations have become the new state of the art for the semantic segmentation task. While there have been recent attempts to learn FCNNs from image-level weak annotations, they need additional constraints, such as the size of an object, to obtain reasonable performance. To address this issue, we present motion-CNN (M-CNN), a novel FCNN framework which incorporates motion cues and is learned from video-level weak annotations. Our learning scheme to train the network uses motion segments as soft constraints, thereby handling noisy motion information. When trained on weakly-annotated videos, our method outperforms the state-of-the-art EM-Adapt approach on the PASCAL VOC 2012 image segmentation benchmark. We also demonstrate that the performance of M-CNN learned with 150 weak video annotations is on par with state-of-the-art weakly-supervised methods trained with thousands of images. Finally, M-CNN substantially outperforms recent approaches in a related task of video co-localization on the YouTube-Objects dataset.

READ FULL TEXT

page 1

page 3

page 4

page 9

page 11

research
10/04/2017

Learning to Segment Human by Watching YouTube

An intuition on human segmentation is that when a human is moving in a v...
research
09/02/2016

Built-in Foreground/Background Prior for Weakly-Supervised Semantic Segmentation

Pixel-level annotations are expensive and time consuming to obtain. Henc...
research
06/09/2019

Movable-Object-Aware Visual SLAM via Weakly Supervised Semantic Segmentation

Moving objects can greatly jeopardize the performance of a visual simult...
research
12/07/2016

Bottom-Up Top-Down Cues for Weakly-Supervised Semantic Segmentation

We consider the task of learning a classifier for semantic segmentation ...
research
02/14/2022

Box Supervised Video Segmentation Proposal Network

Video Object Segmentation (VOS) has been targeted by various fully-super...
research
08/15/2019

Discretely-constrained deep network for weakly supervised segmentation

An efficient strategy for weakly-supervised segmentation is to impose co...
research
08/15/2017

Bringing Background into the Foreground: Making All Classes Equal in Weakly-supervised Video Semantic Segmentation

Pixel-level annotations are expensive and time-consuming to obtain. Henc...

Please sign up or login with your details

Forgot password? Click here to reset