PKU-MMD: A Large Scale Benchmark for Continuous Multi-Modal Human Action Understanding

03/22/2017
by   Chunhui Liu, et al.
0

Despite the fact that many 3D human activity benchmarks being proposed, most existing action datasets focus on the action recognition tasks for the segmented videos. There is a lack of standard large-scale benchmarks, especially for current popular data-hungry deep learning based methods. In this paper, we introduce a new large scale benchmark (PKU-MMD) for continuous multi-modality 3D human action understanding and cover a wide range of complex human activities with well annotated information. PKU-MMD contains 1076 long video sequences in 51 action categories, performed by 66 subjects in three camera views. It contains almost 20,000 action instances and 5.4 million frames in total. Our dataset also provides multi-modality data sources, including RGB, depth, Infrared Radiation and Skeleton. With different modalities, we conduct extensive experiments on our dataset in terms of two scenarios and evaluate different methods by various metrics, including a new proposed evaluation protocol 2D-AP. We believe this large-scale dataset will benefit future researches on action detection for the community.

READ FULL TEXT

page 1

page 8

research
05/12/2019

NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding

Research on depth-based human activity analysis achieved outstanding per...
research
04/02/2021

UAV-Human: A Large Benchmark for Human Behavior Understanding with Unmanned Aerial Vehicles

Human behavior understanding with unmanned aerial vehicles (UAVs) is of ...
research
10/22/2018

VIENA2: A Driving Anticipation Dataset

Action anticipation is critical in scenarios where one needs to react be...
research
04/11/2016

NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis

Recent approaches in depth-based human activity analysis achieved outsta...
research
03/04/2020

ETRI-Activity3D: A Large-Scale RGB-D Dataset for Robots to Recognize Daily Activities of the Elderly

Deep learning, based on which many modern algorithms operate, is well kn...
research
03/03/2014

Multiview Hessian regularized logistic regression for action recognition

With the rapid development of social media sharing, people often need to...
research
12/30/2022

X-MAS: Extremely Large-Scale Multi-Modal Sensor Dataset for Outdoor Surveillance in Real Environments

In robotics and computer vision communities, extensive studies have been...

Please sign up or login with your details

Forgot password? Click here to reset