The challenge of simultaneous object detection and pose estimation: a comparative study

01/24/2018
by   Daniel Oñoro-Rubio, et al.
0

Detecting objects and estimating their pose remains as one of the major challenges of the computer vision research community. There exists a compromise between localizing the objects and estimating their viewpoints. The detector ideally needs to be view-invariant, while the pose estimation process should be able to generalize towards the category-level. This work is an exploration of using deep learning models for solving both problems simultaneously. For doing so, we propose three novel deep learning architectures, which are able to perform a joint detection and pose estimation, where we gradually decouple the two tasks. We also investigate whether the pose estimation problem should be solved as a classification or regression problem, being this still an open question in the computer vision community. We detail a comparative analysis of all our solutions and the methods that currently define the state of the art for this problem. We use PASCAL3D+ and ObjectNet3D datasets to present the thorough experimental evaluation and main results. With the proposed models we achieve the state-of-the-art performance in both datasets.

READ FULL TEXT

page 1

page 6

page 8

page 11

page 13

page 14

research
12/22/2014

Convolutional Neural Networks for joint object detection and pose estimation: A comparative study

In this paper we study the application of convolutional neural networks ...
research
10/19/2022

MC-hands-1M: A glove-wearing hand dataset for pose estimation

Nowadays, the need for large amounts of carefully and complexly annotate...
research
07/28/2020

Accurate, Low-Latency Visual Perception for Autonomous Racing:Challenges, Mechanisms, and Practical Solutions

Autonomous racing provides the opportunity to test safety-critical perce...
research
06/07/2023

BU-CVKit: Extendable Computer Vision Framework for Species Independent Tracking and Analysis

A major bottleneck of interdisciplinary computer vision (CV) research is...
research
05/08/2018

A Mixed Classification-Regression Framework for 3D Pose Estimation from 2D Images

3D pose estimation from a single 2D image is an important and challengin...
research
04/01/2019

The RGB-D Triathlon: Towards Agile Visual Toolboxes for Robots

Deep networks have brought significant advances in robot perception, ena...
research
05/21/2021

Embracing New Techniques in Deep Learning for Estimating Image Memorability

Various work has suggested that the memorability of an image is consiste...

Please sign up or login with your details

Forgot password? Click here to reset