DECOMPL: Decompositional Learning with Attention Pooling for Group Activity Recognition from a Single Volleyball Image

03/11/2023
by   Berker Demirel, et al.
0

Group Activity Recognition (GAR) aims to detect the activity performed by multiple actors in a scene. Prior works model the spatio-temporal features based on the RGB, optical flow or keypoint data types. However, using both the temporality and these data types altogether increase the computational complexity significantly. Our hypothesis is that by only using the RGB data without temporality, the performance can be maintained with a negligible loss in accuracy. To that end, we propose a novel GAR technique for volleyball videos, DECOMPL, which consists of two complementary branches. In the visual branch, it extracts the features using attention pooling in a selective way. In the coordinate branch, it considers the current configuration of the actors and extracts the spatial information from the box coordinates. Moreover, we analyzed the Volleyball dataset that the recent literature is mostly based on, and realized that its labeling scheme degrades the group concept in the activities to the level of individual actors. We manually reannotated the dataset in a systematic manner for emphasizing the group concept. Experimental results on the Volleyball as well as Collective Activity (from another domain, i.e., not volleyball) datasets demonstrated the effectiveness of the proposed model DECOMPL, which delivered the best/second best GAR performance with the reannotations/original annotations among the comparable state-of-the-art techniques. Our code, results and new annotations will be made available through GitHub after the revision process.

READ FULL TEXT

page 5

page 6

research
02/27/2018

ReHAR: Robust and Efficient Human Activity Recognition

Designing a scheme that can achieve a good performance in predicting sin...
research
08/31/2022

Attentive pooling for Group Activity Recognition

In group activity recognition, hierarchical framework is widely adopted ...
research
12/11/2021

COMPOSER: Compositional Learning of Group Activity in Videos

Group Activity Recognition (GAR) detects the activity performed by a gro...
research
04/23/2019

Learning Actor Relation Graphs for Group Activity Recognition

Modeling relation between actors is important for recognizing group acti...
research
12/18/2018

Multi-Level Sequence GAN for Group Activity Recognition

We propose a novel semi-supervised, Multi-Level Sequential Generative Ad...
research
08/13/2019

Three Branches: Detecting Actions With Richer Features

We present our three branch solutions for International Challenge on Act...
research
05/18/2023

XFormer: Fast and Accurate Monocular 3D Body Capture

We present XFormer, a novel human mesh and motion capture method that ac...

Please sign up or login with your details

Forgot password? Click here to reset