Is end-to-end learning enough for fitness activity recognition?

05/14/2023
by   Antoine Mercier, et al.
0

End-to-end learning has taken hold of many computer vision tasks, in particular, related to still images, with task-specific optimization yielding very strong performance. Nevertheless, human-centric action recognition is still largely dominated by hand-crafted pipelines, and only individual components are replaced by neural networks that typically operate on individual frames. As a testbed to study the relevance of such pipelines, we present a new fully annotated video dataset of fitness activities. Any recognition capabilities in this domain are almost exclusively a function of human poses and their temporal dynamics, so pose-based solutions should perform well. We show that, with this labelled data, end-to-end learning on raw pixels can compete with state-of-the-art action recognition pipelines based on pose estimation. We also show that end-to-end learning can support temporally fine-grained tasks such as real-time repetition counting.

READ FULL TEXT

page 3

page 4

page 5

page 16

research
09/03/2021

Video Pose Distillation for Few-Shot, Fine-Grained Sports Action Recognition

Human pose is a useful feature for fine-grained sports action understand...
research
03/29/2023

A Video-based End-to-end Pipeline for Non-nutritive Sucking Action Recognition and Segmentation in Young Infants

We present an end-to-end computer vision pipeline to detect non-nutritiv...
research
12/09/2022

FLAG3D: A 3D Fitness Activity Dataset with Language Instruction

With the continuously thriving popularity around the world, fitness acti...
research
11/26/2018

LSTA: Long Short-Term Attention for Egocentric Action Recognition

Egocentric activity recognition is one of the most challenging tasks in ...
research
05/14/2021

Learning Group Activities from Skeletons without Individual Action Labels

To understand human behavior we must not just recognize individual actio...
research
12/01/2020

A compact sequence encoding scheme for online human activity recognition in HRI applications

Human activity recognition and analysis has always been one of the most ...
research
09/13/2019

BPnP: Further Empowering End-to-End Learning with Back-Propagatable Geometric Optimization

In this paper we present BPnP, a novel method to do back-propagation thr...

Please sign up or login with your details

Forgot password? Click here to reset