Learning Sparse Temporal Video Mapping for Action Quality Assessment in Floor Gymnastics

01/15/2023
by   Sania Zahan, et al.
0

Athlete performance measurement in sports videos requires modeling long sequences since the entire spatio-temporal progression contributes dominantly to the performance. It is crucial to comprehend local discriminative spatial dependencies and global semantics for accurate evaluation. However, existing benchmark datasets mainly incorporate sports where the performance lasts only a few seconds. Consequently, state-ofthe-art sports quality assessment methods specifically focus on spatial structure. Although they achieve high performance in short-term sports, they are unable to model prolonged video sequences and fail to achieve similar performance in long-term sports. To facilitate such analysis, we introduce a new dataset, coined AGF-Olympics, that incorporates artistic gymnastic floor routines. AFG-Olympics provides highly challenging scenarios with extensive background, viewpoint, and scale variations over an extended sample duration of up to 2 minutes. In addition, we propose a discriminative attention module to map the dense feature space into a sparse representation by disentangling complex associations. Extensive experiments indicate that our proposed module provides an effective way to embed long-range spatial and temporal correlation semantics.

READ FULL TEXT

page 1

page 4

page 5

page 8

page 9

research
12/27/2020

Learning Generalized Spatial-Temporal Deep Feature Representation for No-Reference Video Quality Assessment

In this work, we propose a no-reference video quality assessment method,...
research
08/16/2022

Temporal Action Localization with Multi-temporal Scales

Temporal action localization plays an important role in video analysis, ...
research
01/11/2022

TSA-Net: Tube Self-Attention Network for Action Quality Assessment

In recent years, assessing action quality from videos has attracted grow...
research
12/04/2020

A high performance approach to detecting small targets in long range low quality infrared videos

Since targets are small in long range infrared (IR) videos, it is challe...
research
10/12/2021

Video Is Graph: Structured Graph Module for Video Action Recognition

In the field of action recognition, video clips are always treated as or...
research
08/30/2020

Finding Action Tubes with a Sparse-to-Dense Framework

The task of spatial-temporal action detection has attracted increasing a...
research
01/23/2014

Efficient Background Modeling Based on Sparse Representation and Outlier Iterative Removal

Background modeling is a critical component for various vision-based app...

Please sign up or login with your details

Forgot password? Click here to reset