Attend and Segment: Attention Guided Active Semantic Segmentation

07/22/2020
by   Soroush Seifi, et al.
0

In a dynamic environment, an agent with a limited field of view/resource cannot fully observe the scene before attempting to parse it. The deployment of common semantic segmentation architectures is not feasible in such settings. In this paper we propose a method to gradually segment a scene given a sequence of partial observations. The main idea is to refine an agent's understanding of the environment by attending the areas it is most uncertain about. Our method includes a self-supervised attention mechanism and a specialized architecture to maintain and exploit spatial memory maps for filling-in the unseen areas in the environment. The agent can select and attend an area while relying on the cues coming from the visited areas to hallucinate the other parts. We reach a mean pixel-wise accuracy of 78.1 Kitti datasets by processing only 18 glimpses). We perform an ablation study on the number of glimpses, input image size and effectiveness of retina-like glimpses. We compare our method to several baselines and show that the optimal results are achieved by having access to a very low resolution view of the scene at the first timestep.

READ FULL TEXT

page 2

page 6

page 7

page 12

research
10/04/2022

Self-supervised Pre-training for Semantic Segmentation in an Indoor Scene

The ability to endow maps of indoor scenes with semantic information is ...
research
03/20/2021

A Novel Upsampling and Context Convolution for Image Semantic Segmentation

Semantic segmentation, which refers to pixel-wise classification of an i...
research
12/01/2020

3D Guided Weakly Supervised Semantic Segmentation

Pixel-wise clean annotation is necessary for fully-supervised semantic s...
research
02/07/2023

Scaling Self-Supervised End-to-End Driving with Multi-View Attention Learning

On end-to-end driving, a large amount of expert driving demonstrations i...
research
11/16/2021

Robust 3D Scene Segmentation through Hierarchical and Learnable Part-Fusion

3D semantic segmentation is a fundamental building block for several sce...
research
05/25/2020

Egocentric Human Segmentation for Mixed Reality

The objective of this work is to segment human body parts from egocentri...

Please sign up or login with your details

Forgot password? Click here to reset