Accuracy and Performance Comparison of Video Action Recognition Approaches

08/20/2020
by   Matthew Hutchinson, et al.
6

Over the past few years, there has been significant interest in video action recognition systems and models. However, direct comparison of accuracy and computational performance results remain clouded by differing training environments, hardware specifications, hyperparameters, pipelines, and inference methods. This article provides a direct comparison between fourteen off-the-shelf and state-of-the-art models by ensuring consistency in these training characteristics in order to provide readers with a meaningful comparison across different types of video action recognition algorithms. Accuracy of the models is evaluated using standard Top-1 and Top-5 accuracy metrics in addition to a proposed new accuracy metric. Additionally, we compare computational performance of distributed training from two to sixty-four GPUs on a state-of-the-art HPC system.

READ FULL TEXT
research
01/28/2015

Feature Sampling Strategies for Action Recognition

Although dense local spatial-temporal features with bag-of-features repr...
research
11/30/2020

Just One Moment: Inconspicuous One Frame Attack on Deep Action Recognition

The video-based action recognition task has been extensively studied in ...
research
03/21/2018

T-RECS: Training for Rate-Invariant Embeddings by Controlling Speed for Action Recognition

An action should remain identifiable when modifying its speed: consider ...
research
05/29/2023

FMM-X3D: FPGA-based modeling and mapping of X3D for Human Action Recognition

3D Convolutional Neural Networks are gaining increasing attention from r...
research
12/01/2020

A New Action Recognition Framework for Video Highlights Summarization in Sporting Events

To date, machine learning for human action recognition in video has been...
research
05/26/2021

Anticipating human actions by correlating past with the future with Jaccard similarity measures

We propose a framework for early action recognition and anticipation by ...
research
01/26/2019

DistInit: Learning Video Representations without a Single Labeled Video

Video recognition models have progressed significantly over the past few...

Please sign up or login with your details

Forgot password? Click here to reset