Scanner: Efficient Video Analysis at Scale

05/18/2018
by   Alex Poms, et al.
0

A growing number of visual computing applications depend on the analysis of large video collections. The challenge is that scaling applications to operate on these datasets requires efficient systems for pixel data access and parallel processing across large numbers of machines. Few programmers have the capability to operate efficiently at these scales, limiting the field's ability to explore new applications that leverage big video data. In response, we have created Scanner, a system for productive and efficient video analysis at scale. Scanner organizes video collections as tables in a data store optimized for sampling frames from compressed video, and executes pixel processing computations, expressed as dataflow graphs, on these frames. Scanner schedules video analysis applications expressed using these abstractions onto heterogeneous throughput computing hardware, such as multi-core CPUs, GPUs, and media processing ASICs, for high-throughput pixel processing. We demonstrate the productivity of Scanner by authoring a variety of video processing applications including the synthesis of stereo VR video streams from multi-camera rigs, markerless 3D human pose reconstruction from video, and data-mining big video datasets such as hundreds of feature-length films or over 70,000 hours of TV news. These applications achieve near-expert performance on a single machine and scale efficiently to hundreds of machines, enabling formerly long-running big video data analysis tasks to be carried out in minutes to hours.

READ FULL TEXT

page 2

page 10

research
10/10/2018

V3C - a Research Video Collection

With the widespread use of smartphones as recording devices and the mass...
research
05/26/2018

Large-Scale Learning from Data Streams with Apache SAMOA

Apache SAMOA (Scalable Advanced Massive Online Analysis) is an open-sour...
research
02/28/2015

Macroblock Classification Method for Video Applications Involving Motions

In this paper, a macroblock classification method is proposed for variou...
research
10/01/2019

Training Kinetics in 15 Minutes: Large-scale Distributed Training on Videos

Deep video recognition is more computationally expensive than image reco...
research
02/05/2021

Sampling Based Scene-Space Video Processing

Many compelling video processing effects can be achieved if per-pixel de...
research
11/29/2016

A Large-scale Distributed Video Parsing and Evaluation Platform

Visual surveillance systems have become one of the largest data sources ...
research
09/07/2022

ErgoExplorer: Interactive Ergonomic Risk Assessment from Video Collections

Ergonomic risk assessment is now, due to an increased awareness, carried...

Please sign up or login with your details

Forgot password? Click here to reset