Uncertainty-aware State Space Transformer for Egocentric 3D Hand Trajectory Forecasting

07/17/2023
by   Wentao Bao, et al.
0

Hand trajectory forecasting from egocentric views is crucial for enabling a prompt understanding of human intentions when interacting with AR/VR systems. However, existing methods handle this problem in a 2D image space which is inadequate for 3D real-world applications. In this paper, we set up an egocentric 3D hand trajectory forecasting task that aims to predict hand trajectories in a 3D space from early observed RGB videos in a first-person view. To fulfill this goal, we propose an uncertainty-aware state space Transformer (USST) that takes the merits of the attention mechanism and aleatoric uncertainty within the framework of the classical state-space model. The model can be further enhanced by the velocity constraint and visual prompt tuning (VPT) on large vision transformers. Moreover, we develop an annotation workflow to collect 3D hand trajectories with high quality. Experimental results on H2O and EgoPAT3D datasets demonstrate the superiority of USST for both 2D and 3D trajectory forecasting. The code and datasets are publicly released: https://github.com/Cogito2012/USST.

READ FULL TEXT

page 1

page 8

page 12

page 15

page 16

research
05/01/2020

Multi-Camera Trajectory Forecasting: Pedestrian Trajectory Prediction in a Network of Cameras

We introduce the task of multi-camera trajectory forecasting (MCTF), whe...
research
11/29/2021

A white-boxed ISSM approach to estimate uncertainty distributions of Walmart sales

We present our solution for the M5 Forecasting - Uncertainty competition...
research
03/18/2020

Transformer Networks for Trajectory Forecasting

Most recent successes on forecasting the people motion are based on LSTM...
research
02/03/2022

Trajectory Forecasting from Detection with Uncertainty-Aware Motion Encoding

Trajectory forecasting is critical for autonomous platforms to make safe...
research
10/25/2022

Clinically-Inspired Multi-Agent Transformers for Disease Trajectory Forecasting from Multimodal Data

Deep neural networks are often applied to medical images to automate the...
research
05/08/2022

Mutual Distillation Learning Network for Trajectory-User Linking

Trajectory-User Linking (TUL), which links trajectories to users who gen...
research
08/10/2021

Multi-Camera Trajectory Forecasting with Trajectory Tensors

We introduce the problem of multi-camera trajectory forecasting (MCTF), ...

Please sign up or login with your details

Forgot password? Click here to reset