VIDI: A Video Dataset of Incidents

05/26/2022
by   Duygu Sesver, et al.
1

Automatic detection of natural disasters and incidents has become more important as a tool for fast response. There have been many studies to detect incidents using still images and text. However, the number of approaches that exploit temporal information is rather limited. One of the main reasons for this is that a diverse video dataset with various incident types does not exist. To address this need, in this paper we present a video dataset, Video Dataset of Incidents, VIDI, that contains 4,534 video clips corresponding to 43 incident categories. Each incident class has around 100 videos with a duration of ten seconds on average. To increase diversity, the videos have been searched in several languages. To assess the performance of the recent state-of-the-art approaches, Vision Transformer and TimeSformer, as well as to explore the contribution of video-based information for incident classification, we performed benchmark experiments on the VIDI and Incidents Dataset. We have shown that the recent methods improve the incident classification accuracy. We have found that employing video data is very beneficial for the task. By using the video data, the top-1 accuracy is increased to 76.56 was obtained using a single frame. VIDI will be made publicly available. Additional materials can be found at the following link: https://github.com/vididataset/VIDI.

READ FULL TEXT

page 2

page 5

page 6

page 7

research
06/26/2020

Deepfake Detection using Spatiotemporal Convolutional Networks

Better generative models and larger datasets have led to more realistic ...
research
10/04/2022

A Perceptual Quality Metric for Video Frame Interpolation

Research on video frame interpolation has made significant progress in r...
research
05/11/2023

Undercover Deepfakes: Detecting Fake Segments in Videos

The recent renaissance in generative models, driven primarily by the adv...
research
09/14/2023

VCD: A Video Conferencing Dataset for Video Compression

Commonly used datasets for evaluating video codecs are all very high qua...
research
05/23/2023

WinDB: HMD-free and Distortion-free Panoptic Video Fixation Learning

To date, the widely-adopted way to perform fixation collection in panopt...
research
03/20/2020

Comprehensive Instructional Video Analysis: The COIN Dataset and Performance Evaluation

Thanks to the substantial and explosively inscreased instructional video...
research
02/01/2022

Should I take a walk? Estimating Energy Expenditure from Video Data

We explore the problem of automatically inferring the amount of kilocalo...

Please sign up or login with your details

Forgot password? Click here to reset