Deep Reinforcement Learning for Active Human Pose Estimation

01/07/2020
by   Erik Gärtner, et al.
8

Most 3d human pose estimation methods assume that input – be it images of a scene collected from one or several viewpoints, or from a video – is given. Consequently, they focus on estimates leveraging prior knowledge and measurement by fusing information spatially and/or temporally, whenever available. In this paper we address the problem of an active observer with freedom to move and explore the scene spatially – in `time-freeze' mode – and/or temporally, by selecting informative viewpoints that improve its estimation accuracy. Towards this end, we introduce Pose-DRL, a fully trainable deep reinforcement learning-based active pose estimation architecture which learns to select appropriate views, in space and time, to feed an underlying monocular pose estimator. We evaluate our model using single- and multi-target estimators with strong result in both settings. Our system further learns automatic stopping conditions in time and transition functions to the next temporal processing step in videos. In extensive experiments with the Panoptic multi-view setup, and for complex scenes containing multiple people, we show that our model learns to select viewpoints that yield significantly more accurate pose estimates compared to strong multi-view baselines.

READ FULL TEXT

page 3

page 4

page 8

page 9

research
02/07/2019

3D Human Pose Estimation from Deep Multi-View 2D Pose

Human pose estimation - the process of recognizing a human's limb positi...
research
05/19/2019

Geometric Pose Affordance: 3D Human Pose with Scene Constraints

Full 3D estimation of human pose from a single image remains a challengi...
research
11/23/2022

Unsupervised 3D Keypoint Estimation with Multi-View Geometry

Given enough annotated training data, 3D human pose estimation models ca...
research
12/09/2019

DeepFuse: An IMU-Aware Network for Real-Time 3D Human Pose Estimation from Multi-View Image

In this paper, we propose a two-stage fully 3D network, namely DeepFuse,...
research
12/02/2020

Unsupervised Learning on Monocular Videos for 3D Human Pose Estimation

In this paper, we introduce an unsupervised feature extraction method th...
research
10/11/2021

Adaptively Multi-view and Temporal Fusing Transformer for 3D Human Pose Estimation

In practical application, 3D Human Pose Estimation (HPE) is facing with ...
research
10/19/2019

Active 6D Multi-Object Pose Estimation in Cluttered Scenarios with Deep Reinforcement Learning

In this work, we explore how a strategic selection of camera movements c...

Please sign up or login with your details

Forgot password? Click here to reset