SoccerNet: A Scalable Dataset for Action Spotting in Soccer Videos

04/12/2018
by   Silvio Giancola, et al.
0

In this paper, we introduce SoccerNet, a benchmark for action spotting in soccer videos. The dataset is composed of 500 complete soccer games from six main European leagues, covering three seasons from 2014 to 2017 and a total duration of 764 hours. A total of 6,637 temporal annotations are automatically parsed from online match reports at a one minute resolution for three main classes of events (Goal, Yellow/Red Card, and Substitution). As such, the dataset is easily scalable. These annotations are manually refined to a one second resolution by anchoring them at a single timestamp following well-defined soccer rules. With an average of one event every 6.9 minutes, this dataset focuses on the problem of localizing very sparse events within long videos. We define the task of spotting as finding the anchors of soccer events in a video. Making use of recent developments in the realm of generic action recognition and detection in video, we provide strong baselines for detecting soccer events. We show that our best model for classifying temporal segments of length one minute reaches a mean Average Precision (mAP) of 67.8 spotting task, our baseline reaches an Average-mAP of 49.7 δ ranging from 5 to 60 seconds.

READ FULL TEXT

page 1

page 14

page 15

research
11/09/2020

Improved Soccer Action Spotting using both Audio and Video Streams

In this paper, we propose a study on multi-modal (audio and video) actio...
research
02/15/2021

RMS-Net: Regression and Masking for Soccer Event Spotting

The recently proposed action spotting task consists in finding the exact...
research
02/06/2023

Baseline Method for the Sport Task of MediaEval 2022 with 3D CNNs using Attention Mechanisms

This paper presents the baseline method proposed for the Sports Video ta...
research
12/17/2020

Multi-shot Temporal Event Localization: a Benchmark

Current developments in temporal event or action localization usually ta...
research
05/10/2012

Hajj and Umrah Event Recognition Datasets

In this note, new Hajj and Umrah Event Recognition datasets (HUER) are p...
research
10/04/2018

Towards High Resolution Video Generation with Progressive Growing of Sliced Wasserstein GANs

The extension of image generation to video generation turns out to be a ...
research
03/29/2023

A Video-based End-to-end Pipeline for Non-nutritive Sucking Action Recognition and Segmentation in Young Infants

We present an end-to-end computer vision pipeline to detect non-nutritiv...

Please sign up or login with your details

Forgot password? Click here to reset