Beyond One Glance: Gated Recurrent Architecture for Hand Segmentation

11/27/2018
by   Wei Wang, et al.
0

As mixed reality is gaining increased momentum, the development of effective and efficient solutions to egocentric hand segmentation is becoming critical. Traditional segmentation techniques typically follow a one-shot approach, where the image is passed forward only once through a model that produces a segmentation mask. This strategy, however, does not reflect the perception of humans, who continuously refine their representation of the world. In this paper, we therefore introduce a novel gated recurrent architecture. It goes beyond both iteratively passing the predicted segmentation mask through the network and adding a standard recurrent unit to it. Instead, it incorporates multiple encoder-decoder layers of the segmentation network, so as to keep track of its internal state in the refinement process. As evidenced by our results on standard hand segmentation benchmarks and on our own dataset, our approach outperforms these other, simpler recurrent segmentation techniques, as well as the state-of-the-art hand segmentation one. Furthermore, we demonstrate the generality of our approach by applying it to road segmentation, where it also outperforms other baseline methods.

READ FULL TEXT

page 5

page 7

page 8

research
06/11/2019

Recurrent U-Net for Resource-Constrained Segmentation

State-of-the-art segmentation methods rely on very deep networks that ar...
research
11/16/2016

Convolutional Gated Recurrent Networks for Video Segmentation

Semantic segmentation has recently witnessed major progress, where fully...
research
06/01/2016

Recurrent Fully Convolutional Networks for Video Segmentation

Image segmentation is an important step in most visual tasks. While conv...
research
05/26/2023

Maskomaly:Zero-Shot Mask Anomaly Segmentation

We present a simple and practical framework for anomaly segmentation cal...
research
05/01/2018

Semantic Binary Segmentation using Convolutional Networks without Decoders

In this paper, we propose an efficient architecture for semantic image s...
research
09/08/2018

Simplified Hierarchical Recurrent Encoder-Decoder for Building End-To-End Dialogue Systems

As a generative model for building end-to-end dialogue systems, Hierarch...
research
07/05/2021

Are standard Object Segmentation models sufficient for Learning Affordance Segmentation?

Affordances are the possibilities of actions the environment offers to t...

Please sign up or login with your details

Forgot password? Click here to reset