Identifying and Extracting Football Features from Real-World Media Sources using Only Synthetic Training Data

09/27/2022
by   Jose Cerqueira Fernandes, et al.
0

Real-world images used for training machine learning algorithms are often unstructured and inconsistent. The process of analysing and tagging these images can be costly and error prone (also availability, gaps and legal conundrums). However, as we demonstrate in this article, the potential to generate accurate graphical images that are indistinguishable from real-world sources has a multitude of benefits in machine learning paradigms. One such example of this is football data from broadcast services (television and other streaming media sources). The football games are usually recorded from multiple sources (cameras and phones) and resolutions, not to mention, occlusion of visual details and other artefacts (like blurring, weathering and lighting conditions) which make it difficult to accurately identify features. We demonstrate an approach which is able to overcome these limitations using generated tagged and structured images. The generated images are able to simulate a variety views and conditions (including noise and blurring) which may only occur sporadically in real-world data and make it difficult for machine learning algorithm to 'cope' with these unforeseen problems in real-data. This approach enables us to rapidly train and prepare a robust solution that accurately extracts features (e.g., spacial locations, markers on the pitch, player positions, ball location and camera FOV) from real-world football match sources for analytical purposes.

READ FULL TEXT

page 1

page 3

page 4

research
10/06/2016

Driving in the Matrix: Can Virtual Worlds Replace Human-Generated Annotations for Real World Tasks?

Deep learning has rapidly transformed the state of the art algorithms us...
research
05/01/2023

A novel algorithm can generate data to train machine learning models in conditions of extreme scarcity of real world data

Training machine learning models requires large datasets. However, colle...
research
10/11/2017

GeneSIS-RT: Generating Synthetic Images for training Secondary Real-world Tasks

We propose a novel approach for generating high-quality, synthetic data ...
research
11/30/2021

RADU: Ray-Aligned Depth Update Convolutions for ToF Data Denoising

Time-of-Flight (ToF) cameras are subject to high levels of noise and dis...
research
04/05/2019

Bayesian Heatmaps: Probabilistic Classification with Multiple Unreliable Information Sources

Unstructured data from diverse sources, such as social media and aerial ...
research
06/22/2022

Deep Learning to Jointly Schema Match, Impute, and Transform Databases

An applied problem facing all areas of data science is harmonizing data ...
research
07/21/2023

Using simulation to calibrate real data acquisition in veterinary medicine

This paper explores the innovative use of simulation environments to enh...

Please sign up or login with your details

Forgot password? Click here to reset