Charades-Ego: A Large-Scale Dataset of Paired Third and First Person Videos

04/25/2018
by   Gunnar A. Sigurdsson, et al.
0

In Actor and Observer we introduced a dataset linking the first and third-person video understanding domains, the Charades-Ego Dataset. In this paper we describe the egocentric aspect of the dataset and present annotations for Charades-Ego with 68,536 activity instances in 68.8 hours of first and third-person video, making it one of the largest and most diverse egocentric datasets available. Charades-Ego furthermore shares activity classes, scripts, and methodology with the Charades dataset, that consist of additional 82.3 hours of third-person video with 66,500 activity instances. Charades-Ego has temporal annotations and textual descriptions, making it suitable for egocentric video classification, localization, captioning, and new tasks utilizing the cross-modal nature of the data.

READ FULL TEXT

page 1

page 2

research
06/03/2021

APES: Audiovisual Person Search in Untrimmed Video

Humans are arguably one of the most important subjects in video streams,...
research
12/02/2020

MEVA: A Large-Scale Multiview, Multimodal Video Dataset for Activity Detection

We present the Multiview Extended Video with Activities (MEVA) dataset, ...
research
04/25/2018

Actor and Observer: Joint Modeling of First and Third-Person Videos

Several theories in cognitive neuroscience suggest that when people inte...
research
07/14/2023

TVPR: Text-to-Video Person Retrieval and a New Benchmark

Most existing methods for text-based person retrieval focus on text-to-i...
research
06/01/2023

MammalNet: A Large-scale Video Benchmark for Mammal Recognition and Behavior Understanding

Monitoring animal behavior can facilitate conservation efforts by provid...
research
06/11/2020

Privacy-Aware Activity Classification from First Person Office Videos

In the advent of wearable body-cameras, human activity classification fr...
research
08/28/2019

Out the Window: A Crowd-Sourced Dataset for Activity Classification in Surveillance Video

The Out the Window (OTW) dataset is a crowdsourced activity dataset cont...

Please sign up or login with your details

Forgot password? Click here to reset