Cricket stroke extraction: Towards creation of a large-scale cricket actions dataset

01/10/2019
by   Arpan Gupta, et al.
0

In this paper, we deal with the problem of temporal action localization for a large-scale untrimmed cricket videos dataset. Our action of interest for cricket videos is a cricket stroke played by a batsman, which is, usually, covered by cameras placed at the stands of the cricket ground at both ends of the cricket pitch. After applying a sequence of preprocessing steps, we have 73 million frames for 1110 videos in the dataset at constant frame rate and resolution. The method of localization is a generalized one which applies a trained random forest model for CUTs detection(using summed up grayscale histogram difference features) and two linear SVM camera models(CAM1 and CAM2) for first frame detection, trained on HOG features of CAM1 and CAM2 video shots. CAM1 and CAM2 are assumed to be part of the cricket stroke. At the predicted boundary positions, the HOG features of the first frames are computed and a simple algorithm was used to combine the positively predicted camera shots. In order to make the process as generic as possible, we did not consider any domain specific knowledge, such as tracking or specific shape and motion features. The detailed analysis of our methodology is provided along with the metrics used for evaluation of individual models, and the final predicted segments. We achieved a weighted mean TIoU of 0.5097 over a small sample of the test set.

READ FULL TEXT

page 7

page 11

research
04/30/2022

RADNet: A Deep Neural Network Model for Robust Perception in Moving Autonomous Systems

Interactive autonomous applications require robustness of the perception...
research
04/26/2018

Deep Keyframe Detection in Human Action Videos

Detecting representative frames in videos based on human actions is quit...
research
03/15/2020

SF-Net: Single-Frame Supervision for Temporal Action Localization

In this paper, we study an intermediate form of supervision, i.e., singl...
research
12/26/2017

SLAC: A Sparsely Labeled Dataset for Action Classification and Localization

This paper describes a procedure for the creation of large-scale video d...
research
05/15/2019

Synthetic Defocus and Look-Ahead Autofocus for Casual Videography

In cinema, large camera lenses create beautiful shallow depth of field (...
research
04/22/2021

Localization of Ice-Rink for Broadcast Hockey Videos

In this work, an automatic and simple framework for hockey ice-rink loca...
research
12/03/2020

Motion-based Camera Localization System in Colonoscopy Videos

Optical colonoscopy is an essential diagnostic and prognostic tool for m...

Please sign up or login with your details

Forgot password? Click here to reset