Learning Human Pose Models from Synthesized Data for Robust RGB-D Action Recognition

07/04/2017
by   Jian Liu, et al.
0

We propose Human Pose Models that represent RGB and depth images of human poses independent of clothing textures, backgrounds, lighting conditions, body shapes and camera viewpoints. Learning such universal models requires training images where all factors are varied for every human pose. Capturing such data is prohibitively expensive. Therefore, we develop a framework for synthesizing the training data. First, we learn representative human poses from a large corpus of real motion captured human skeleton data. Next, we fit synthetic 3D humans with different body shapes to each pose and render each from 180 camera viewpoints while randomly varying the clothing textures, background and lighting. Generative Adversarial Networks are employed to minimize the gap between synthetic and real image distributions. CNN models are then learned that transfer human poses to a shared high-level invariant space. The learned CNN models are then used as invariant feature extractors from real RGB and depth frames of human action videos and the temporal variations are modelled by Fourier Temporal Pyramid. Finally, linear SVM is used for classification. Experiments on three benchmark cross-view human action datasets show that our algorithm outperforms existing methods by significant margins for RGB only and RGB-D action recognition.

READ FULL TEXT

page 2

page 5

page 6

page 7

page 8

page 9

page 10

research
09/17/2021

Unsupervised View-Invariant Human Posture Representation

Most recent view-invariant action recognition and performance assessment...
research
12/14/2018

Action Machine: Rethinking Action Recognition in Trimmed Videos

Existing methods in video action recognition mostly do not distinguish h...
research
12/08/2019

View-invariant Deep Architecture for Human Action Recognition using late fusion

Human action Recognition for unknown views is a challenging task. We pro...
research
12/09/2019

Synthetic Humans for Action Recognition from Unseen Viewpoints

Our goal in this work is to improve the performance of human action reco...
research
12/16/2018

Human Pose and Path Estimation from Aerial Video using Dynamic Classifier Selection

We consider the problem of estimating human pose and trajectory by an ae...
research
10/23/2020

View-Invariant, Occlusion-Robust Probabilistic Embedding for Human Pose

Recognition of human poses and activities is crucial for autonomous syst...
research
06/25/2021

Image-to-image Transformation with Auxiliary Condition

The performance of image recognition like human pose detection, trained ...

Please sign up or login with your details

Forgot password? Click here to reset