H+O: Unified Egocentric Recognition of 3D Hand-Object Poses and Interactions

04/10/2019
by   Bugra Tekin, et al.
8

We present a unified framework for understanding 3D hand and object interactions in raw image sequences from egocentric RGB cameras. Given a single RGB image, our model jointly estimates the 3D hand and object poses, models their interactions, and recognizes the object and action classes with a single feed-forward pass through a neural network. We propose a single architecture that does not rely on external detection algorithms but rather is trained end-to-end on single images. We further merge and propagate information in the temporal domain to infer interactions between hand and object trajectories and recognize actions. The complete model takes as input a sequence of frames and outputs per-frame 3D hand and object pose predictions along with the estimates of object and action categories for the entire sequence. We demonstrate state-of-the-art performance of our algorithm even in comparison to the approaches that work on depth data and ground-truth annotations.

READ FULL TEXT

page 1

page 8

page 11

page 12

page 13

research
11/28/2016

Social Scene Understanding: End-to-End Multi-Person Action Localization and Collective Activity Recognition

We present a unified framework for understanding human social behaviors ...
research
03/27/2017

Trespassing the Boundaries: Labeling Temporal Bounds for Object Interactions in Egocentric Video

Manual annotations of temporal bounds for object interactions (i.e. star...
research
04/22/2021

H2O: Two Hands Manipulating Objects for First Person Interaction Recognition

We present, for the first time, a comprehensive framework for egocentric...
research
04/28/2020

Leveraging Photometric Consistency over Time for Sparsely Supervised Hand-Object Reconstruction

Modeling hand-object manipulations is essential for understanding how hu...
research
06/06/2015

Capturing Hands in Action using Discriminative Salient Points and Physics Simulation

Hand motion capture is a popular research field, recently gaining more a...
research
08/22/2019

Learning Object-Action Relations from Bimanual Human Demonstration Using Graph Networks

Recognising human actions is a vital task for a humanoid robot, especial...
research
12/17/2020

Reconstructing Hand-Object Interactions in the Wild

In this work we explore reconstructing hand-object interactions in the w...

Please sign up or login with your details

Forgot password? Click here to reset