Entry-Flipped Transformer for Inference and Prediction of Participant Behavior

07/13/2022
by   Bo Hu, et al.
0

Some group activities, such as team sports and choreographed dances, involve closely coupled interaction between participants. Here we investigate the tasks of inferring and predicting participant behavior, in terms of motion paths and actions, under such conditions. We narrow the problem to that of estimating how a set target participants react to the behavior of other observed participants. Our key idea is to model the spatio-temporal relations among participants in a manner that is robust to error accumulation during frame-wise inference and prediction. We propose a novel Entry-Flipped Transformer (EF-Transformer), which models the relations of participants by attention mechanisms on both spatial and temporal domains. Unlike typical transformers, we tackle the problem of error accumulation by flipping the order of query, key, and value entries, to increase the importance and fidelity of observed features in the current frame. Comparative experiments show that our EF-Transformer achieves the best performance on a newly-collected tennis doubles dataset, a Ceilidh dance dataset, and two pedestrian datasets. Furthermore, it is also demonstrated that our EF-Transformer is better at limiting accumulated errors and recovering from wrong estimations.

READ FULL TEXT

page 10

page 24

page 25

page 26

page 27

research
05/18/2020

Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory Prediction

Understanding crowd motion dynamics is critical to real-world applicatio...
research
01/20/2023

Towards Robust Video Instance Segmentation with Temporal-Aware Transformer

Most existing transformer based video instance segmentation methods extr...
research
10/07/2022

Time-Space Transformers for Video Panoptic Segmentation

We propose a novel solution for the task of video panoptic segmentation,...
research
11/01/2021

Transformers for prompt-level EMA non-response prediction

Ecological Momentary Assessments (EMAs) are an important psychological d...
research
02/16/2023

Robust Human Motion Forecasting using Transformer-based Model

Comprehending human motion is a fundamental challenge for developing Hum...
research
03/16/2023

Predicting Human Attention using Computational Attention

Most models of visual attention are aimed at predicting either top-down ...

Please sign up or login with your details

Forgot password? Click here to reset