MEVA: A Large-Scale Multiview, Multimodal Video Dataset for Activity Detection

12/02/2020
by   Kellie Corona, et al.
0

We present the Multiview Extended Video with Activities (MEVA) dataset, a new and very-large-scale dataset for human activity recognition. Existing security datasets either focus on activity counts by aggregating public video disseminated due to its content, which typically excludes same-scene background video, or they achieve persistence by observing public areas and thus cannot control for activity content. Our dataset is over 9300 hours of untrimmed, continuous video, scripted to include diverse, simultaneous activities, along with spontaneous background activity. We have annotated 144 hours for 37 activity types, marking bounding boxes of actors and props. Our collection observed approximately 100 actors performing scripted scenarios and spontaneous background activity over a three-week period at an access-controlled venue, collecting in multiple modalities with overlapping and non-overlapping indoor and outdoor viewpoints. The resulting data includes video from 38 RGB and thermal IR cameras, 42 hours of UAV footage, as well as GPS locations for the actors. 122 hours of annotation are sequestered in support of the NIST Activity in Extended Video (ActEV) challenge; the other 22 hours of annotation and the corresponding video are available on our website, along with an additional 306 hours of ground camera data, 4.6 hours of UAV data, and 9.6 hours of GPS logs. Additional derived data includes camera models geo-registering the outdoor cameras and a dense 3D point cloud model of the outdoor scene. The data was collected with IRB oversight and approval and released under a CC-BY-4.0 license.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 6

page 7

page 8

research
04/25/2018

Charades-Ego: A Large-Scale Dataset of Paired Third and First Person Videos

In Actor and Observer we introduced a dataset linking the first and thir...
research
09/12/2022

BON: An extended public domain dataset for human activity recognition

Body-worn first-person vision (FPV) camera enables to extract a rich sou...
research
08/28/2019

Out the Window: A Crowd-Sourced Dataset for Activity Classification in Surveillance Video

The Out the Window (OTW) dataset is a crowdsourced activity dataset cont...
research
04/18/2023

Multimodal Group Activity Dataset for Classroom Engagement Level Prediction

We collected a new dataset that includes approximately eight hours of au...
research
09/13/2023

So you think you can track?

This work introduces a multi-camera tracking dataset consisting of 234 h...
research
04/11/2023

The MONET dataset: Multimodal drone thermal dataset recorded in rural scenarios

We present MONET, a new multimodal dataset captured using a thermal came...
research
12/05/2022

Muscles in Action

Small differences in a person's motion can engage drastically different ...

Please sign up or login with your details

Forgot password? Click here to reset