Mean Shift Mask Transformer for Unseen Object Instance Segmentation

11/21/2022
by   Yangxiao Lu, et al.
0

Segmenting unseen objects is a critical task in many different domains. For example, a robot may need to grasp an unseen object, which means it needs to visually separate this object from the background and/or other objects. Mean shift clustering is a common method in object segmentation tasks. However, the traditional mean shift clustering algorithm is not easily integrated into an end-to-end neural network training pipeline. In this work, we propose the Mean Shift Mask Transformer (MSMFormer), a new transformer architecture that simulates the von Mises-Fisher (vMF) mean shift clustering algorithm, allowing for the joint training and inference of both the feature extractor and the clustering. Its central component is a hypersphere attention mechanism, which updates object queries on a hypersphere. To illustrate the effectiveness of our method, we apply MSMFormer to Unseen Object Instance Segmentation, which yields a new state-of-the-art of 87.3 Boundary F-meansure on the real-world Object Clutter Indoor Dataset (OCID). Code is available at https://github.com/YoungSean/UnseenObjectsWithMeanShift

READ FULL TEXT

page 5

page 8

page 12

page 13

research
07/30/2020

Learning RGB-D Feature Embeddings for Unseen Object Instance Segmentation

Segmenting unseen objects in cluttered scenes is an important skill that...
research
06/04/2021

SOLQ: Segmenting Objects by Learning Queries

In this paper, we propose an end-to-end framework for instance segmentat...
research
08/14/2023

A Unified Query-based Paradigm for Camouflaged Instance Segmentation

Due to the high similarity between camouflaged instances and the backgro...
research
07/08/2022

k-means Mask Transformer

The rise of transformers in vision tasks not only advances network backb...
research
06/09/2023

FLSL: Feature-level Self-supervised Learning

Current self-supervised learning (SSL) methods (e.g., SimCLR, DINO, VICR...
research
12/07/2021

Deep Level Set for Box-supervised Instance Segmentation in Aerial Images

Box-supervised instance segmentation has recently attracted lots of rese...
research
06/17/2022

CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation

We propose Clustering Mask Transformer (CMT-DeepLab), a transformer-base...

Please sign up or login with your details

Forgot password? Click here to reset