ACNet: Attention Based Network to Exploit Complementary Features for RGBD Semantic Segmentation

05/24/2019
by   Xinxin Hu, et al.
0

Compared to RGB semantic segmentation, RGBD semantic segmentation can achieve better performance by taking depth information into consideration. However, it is still problematic for contemporary segmenters to effectively exploit RGBD information since the feature distributions of RGB and depth (D) images vary significantly in different scenes. In this paper, we propose an Attention Complementary Network (ACNet) that selectively gathers features from RGB and depth branches. The main contributions lie in the Attention Complementary Module (ACM) and the architecture with three parallel branches. More precisely, ACM is a channel attention-based module that extracts weighted features from RGB and depth branches. The architecture preserves the inference of the original RGB and depth branches, and enables the fusion branch at the same time. Based on the above structures, ACNet is capable of exploiting more high-quality features from different channels. We evaluate our model on SUN-RGBD and NYUDv2 datasets, and prove that our model outperforms state-of-the-art methods. In particular, a mIoU score of 48.3% on NYUDv2 test set is achieved with ResNet50. We will release our source code based on PyTorch and the trained segmentation model at https://github.com/anheidelonghu/ACNet.

READ FULL TEXT

page 1

page 2

page 3

research
12/25/2019

Multi-Modal Attention-based Fusion Model for Semantic Segmentation of RGB-Depth Images

The 3D scene understanding is mainly considered as a crucial requirement...
research
10/18/2021

FEANet: Feature-Enhanced Attention Network for RGB-Thermal Real-time Semantic Segmentation

The RGB-Thermal (RGB-T) information for semantic segmentation has been e...
research
10/26/2022

RGB-T Semantic Segmentation with Location, Activation, and Sharpening

Semantic segmentation is important for scene understanding. To address t...
research
11/08/2022

DepthFormer: Multimodal Positional Encodings and Cross-Input Attention for Transformer-Based Segmentation Networks

Most approaches for semantic segmentation use only information from colo...
research
04/11/2017

Quality Aware Network for Set to Set Recognition

This paper targets on the problem of set to set recognition, which learn...
research
03/31/2018

FloorNet: A Unified Framework for Floorplan Reconstruction from 3D Scans

The ultimate goal of this indoor mapping research is to automatically re...
research
09/04/2023

AGG-Net: Attention Guided Gated-convolutional Network for Depth Image Completion

Recently, stereo vision based on lightweight RGBD cameras has been widel...

Please sign up or login with your details

Forgot password? Click here to reset