Bimodal SegNet: Instance Segmentation Fusing Events and RGB Frames for Robotic Grasping

03/20/2023
by   Sanket Kachole, et al.
0

Object segmentation for robotic grasping under dynamic conditions often faces challenges such as occlusion, low light conditions, motion blur and object size variance. To address these challenges, we propose a Deep Learning network that fuses two types of visual signals, event-based data and RGB frame data. The proposed Bimodal SegNet network has two distinct encoders, one for each signal input and a spatial pyramidal pooling with atrous convolutions. Encoders capture rich contextual information by pooling the concatenated features at different resolutions while the decoder obtains sharp object boundaries. The evaluation of the proposed method undertakes five unique image degradation challenges including occlusion, blur, brightness, trajectory and scale variance on the Event-based Segmentation (ESD) Dataset. The evaluation results show a 6-10% segmentation accuracy improvement over state-of-the-art methods in terms of mean intersection over the union and pixel accuracy. The model code is available at https://github.com/sanket0707/Bimodal-SegNet.git

READ FULL TEXT

page 1

page 4

page 5

page 7

research
05/05/2023

Asynchronous Events-based Panoptic Segmentation using Graph Mixer Neural Network

In the context of robotic grasping, object segmentation encounters sever...
research
12/07/2020

CompFeat: Comprehensive Feature Aggregation for Video Instance Segmentation

Video instance segmentation is a complex task in which we need to detect...
research
07/16/2020

Unseen Object Instance Segmentation for Robotic Environments

In order to function in unstructured environments, robots need the abili...
research
03/23/2021

Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers

Segmenting highly-overlapping objects is challenging, because typically ...
research
11/19/2022

HALSIE - Hybrid Approach to Learning Segmentation by Simultaneously Exploiting Image and Event Modalities

Standard frame-based algorithms fail to retrieve accurate segmentation m...
research
08/08/2023

SODFormer: Streaming Object Detection with Transformer Using Events and Frames

DAVIS camera, streaming two complementary sensing modalities of asynchro...
research
03/03/2022

E-CIR: Event-Enhanced Continuous Intensity Recovery

A camera begins to sense light the moment we press the shutter button. D...

Please sign up or login with your details

Forgot password? Click here to reset