U2-ONet: A Two-level Nested Octave U-structure with Multiscale Attention Mechanism for Moving Instances Segmentation

07/26/2020
by   Chenjie Wang, et al.
0

Most scenes in practical applications are dynamic scenes containing moving objects, so segmenting accurately moving objects is crucial for many computer vision applications. In order to efficiently segment out all moving objects in the scene, regardless of whether the object has a predefined semantic label, we propose a two-level nested Octave U-structure network with a multiscale attention mechanism called U2-ONet. Each stage of U2-ONet is filled with our newly designed Octave ReSidual U-block (ORSU) to enhance the ability to obtain more context information at different scales while reducing spatial redundancy of feature maps. In order to efficiently train our multi-scale deep network, we introduce a hierarchical training supervision strategy that calculates the loss at each level while adding a knowledge matching loss to keep the optimization consistency. Experimental results show that our method achieves state-of-the-art performance in several general moving objects segmentation datasets.

READ FULL TEXT

page 1

page 3

page 7

page 9

research
04/05/2021

Hierarchical Pyramid Representations for Semantic Segmentation

Understanding the context of complex and cluttered scenes is a challengi...
research
10/16/2018

Semantic Aware Attention Based Deep Object Co-segmentation

Object co-segmentation is the task of segmenting the same objects from m...
research
04/28/2023

DSEC-MOS: Segment Any Moving Object with Moving Ego Vehicle

Moving Object Segmentation (MOS), a crucial task in computer vision, has...
research
12/27/2022

Interactive Segmentation of Radiance Fields

Radiance Fields (RF) are popular to represent casually-captured scenes f...
research
03/29/2019

DenseAttentionSeg: Segment Hands from Interacted Objects Using Depth Input

We propose a real-time DNN-based technique to segment hand and object of...
research
07/10/2018

Deep Imbalanced Attribute Classification using Visual Attention Aggregation

For many computer vision applications such as image description and huma...
research
03/13/2023

SCPNet: Semantic Scene Completion on Point Cloud

Training deep models for semantic scene completion (SSC) is challenging ...

Please sign up or login with your details

Forgot password? Click here to reset