Deep Learning on Monocular Object Pose Detection and Tracking: A Comprehensive Overview

05/29/2021
by   Zhaoxin Fan, et al.
1

Object pose detection and tracking has recently attracted increasing attention due to its wide applications in many areas, such as autonomous driving, robotics, and augmented reality. Among methods for object pose detection and tracking, deep learning is the most promising one that has shown better performance than others. However, there is lack of survey study about latest development of deep learning based methods. Therefore, this paper presents a comprehensive review of recent progress in object pose detection and tracking that belongs to the deep learning technical route. To achieve a more thorough introduction, the scope of this paper is limited to methods taking monocular RGB/RGBD data as input, covering three kinds of major tasks: instance-level monocular object pose detection, category-level monocular object pose detection, and monocular object pose tracking. In our work, metrics, datasets, and methods about both detection and tracking are presented in detail. Comparative results of current state-of-the-art methods on several publicly available datasets are also presented, together with insightful observations and inspiring future research directions.

READ FULL TEXT
research
02/23/2023

Open Challenges for Monocular Single-shot 6D Object Pose Estimation

Object pose estimation is a non-trivial task that enables robotic manipu...
research
09/05/2021

Deep Person Generation: A Survey from the Perspective of Face, Pose and Cloth Synthesis

Deep person generation has attracted extensive research attention due to...
research
05/19/2023

RGB-D And Thermal Sensor Fusion: A Systematic Literature Review

In the last decade, the computer vision field has seen significant progr...
research
01/28/2020

A Review on Object Pose Recovery: from 3D Bounding Box Detectors to Full 6D Pose Estimators

Object pose recovery has gained increasing attention in the computer vis...
research
05/23/2022

Keypoint-Based Category-Level Object Pose Tracking from an RGB Sequence with Uncertainty Estimation

We propose a single-stage, category-level 6-DoF pose estimation algorith...
research
11/16/2021

Towards Real-Time Monocular Depth Estimation for Robotics: A Survey

As an essential component for many autonomous driving and robotic activi...
research
12/06/2018

ROI-10D: Monocular Lifting of 2D Detection to 6D Pose and Metric Shape

We present a deep learning method for end-to-end monocular 3D object det...

Please sign up or login with your details

Forgot password? Click here to reset