AnyOKP: One-Shot and Instance-Aware Object Keypoint Extraction with Pretrained ViT

09/15/2023
by   Fangbo Qin, et al.
0

Towards flexible object-centric visual perception, we propose a one-shot instance-aware object keypoint (OKP) extraction approach, AnyOKP, which leverages the powerful representation ability of pretrained vision transformer (ViT), and can obtain keypoints on multiple object instances of arbitrary category after learning from a support image. An off-the-shelf petrained ViT is directly deployed for generalizable and transferable feature extraction, which is followed by training-free feature enhancement. The best-prototype pairs (BPPs) are searched for in support and query images based on appearance similarity, to yield instance-unaware candidate keypoints.Then, the entire graph with all candidate keypoints as vertices are divided to sub-graphs according to the feature distributions on the graph edges. Finally, each sub-graph represents an object instance. AnyOKP is evaluated on real object images collected with the cameras of a robot arm, a mobile robot, and a surgical robot, which not only demonstrates the cross-category flexibility and instance awareness, but also show remarkable robustness to domain shift and viewpoint change.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 6

research
05/24/2022

OnePose: One-Shot Object Pose Estimation without CAD Models

We propose a new method named OnePose for object pose estimation. Unlike...
research
03/12/2021

FS-Net: Fast Shape-based Network for Category-Level 6D Object Pose Estimation with Decoupled Rotation Mechanism

In this paper, we focus on category-level 6D pose and size estimation fr...
research
10/07/2020

Contour Primitive of Interest Extraction Network Based on One-shot Learning for Object-Agnostic Vision Measurement

Image contour based vision measurement is widely applied in robot manipu...
research
03/11/2021

Unknown Object Segmentation from Stereo Images

Although instance-aware perception is a key prerequisite for many autono...
research
10/19/2022

Prompting through Prototype: A Prototype-based Prompt Learning on Pretrained Vision-Language Models

Prompt learning is a new learning paradigm which reformulates downstream...
research
05/07/2022

Category-Independent Articulated Object Tracking with Factor Graphs

Robots deployed in human-centric environments may need to manipulate a d...
research
07/22/2017

Single-Shot Clothing Category Recognition in Free-Configurations with Application to Autonomous Clothes Sorting

This paper proposes a single-shot approach for recognising clothing cate...

Please sign up or login with your details

Forgot password? Click here to reset