Object Pose Estimation using Mid-level Visual Representations

03/02/2022
by   Negar Nejatishahidin, et al.
0

This work proposes a novel pose estimation model for object categories that can be effectively transferred to previously unseen environments. The deep convolutional network models (CNN) for pose estimation are typically trained and evaluated on datasets specifically curated for object detection, pose estimation, or 3D reconstruction, which requires large amounts of training data. In this work, we propose a model for pose estimation that can be trained with small amount of data and is built on the top of generic mid-level representations <cit.> (e.g. surface normal estimation and re-shading). These representations are trained on a large dataset without requiring pose and object annotations. Later on, the predictions are refined with a small CNN neural network that exploits object masks and silhouette retrieval. The presented approach achieves superior performance on the Pix3D dataset <cit.> and shows nearly 35% improvement over the existing models when only 25% of the training data is available. We show that the approach is favorable when it comes to generalization and transfer to novel environments. Towards this end, we introduce a new pose estimation benchmark for commonly encountered furniture categories on challenging Active Vision Dataset <cit.> and evaluated the models trained on the Pix3D dataset.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 6

research
12/01/2019

LatentFusion: End-to-End Differentiable Reconstruction and Rendering for Unseen Object Pose Estimation

Current 6D object pose estimation methods usually require a 3D model for...
research
06/12/2018

3D Pose Estimation for Fine-Grained Object Categories

Existing object pose estimation datasets are related to generic object t...
research
03/01/2022

ProgressLabeller: Visual Data Stream Annotation for Training Object-Centric 3D Perception

Visual perception tasks often require vast amounts of labelled data, inc...
research
06/06/2022

Hardware-accelerated Mars Sample Localization via deep transfer learning from photorealistic simulations

The goal of the Mars Sample Return campaign is to collect soil samples f...
research
03/02/2022

OVE6D: Object Viewpoint Encoding for Depth-based 6D Object Pose Estimation

This paper proposes a universal framework, called OVE6D, for model-based...
research
10/26/2021

Incremental Learning for Animal Pose Estimation using RBF k-DPP

Pose estimation is the task of locating keypoints for an object of inter...
research
11/29/2022

Finer-Grained Correlations: Location Priors for Unseen Object Pose Estimation

We present a new method which provides object location priors for previo...

Please sign up or login with your details

Forgot password? Click here to reset