Semi-supervised and Deep learning Frameworks for Video Classification and Key-frame Identification

03/25/2022
by   Sohini Roychowdhury, et al.
0

Automating video-based data and machine learning pipelines poses several challenges including metadata generation for efficient storage and retrieval and isolation of key-frames for scene understanding tasks. In this work, we present two semi-supervised approaches that automate this process of manual frame sifting in video streams by automatically classifying scenes for content and filtering frames for fine-tuning scene understanding tasks. The first rule-based method starts from a pre-trained object detector and it assigns scene type, uncertainty and lighting categories to each frame based on probability distributions of foreground objects. Next, frames with the highest uncertainty and structural dissimilarity are isolated as key-frames. The second method relies on the simCLR model for frame encoding followed by label-spreading from 20 scene and lighting categories. Also, clustering the video frames in the encoded feature space further isolates key-frames at cluster boundaries. The proposed methods achieve 64-93 image videos from public domain datasets of JAAD and KITTI. Also, less than 10 of all input frames can be filtered as key-frames that can then be sent for annotation and fine tuning of machine vision algorithms. Thus, the proposed framework can be scaled to additional video data streams for automated training of perception-driven systems with minimal training images.

READ FULL TEXT

page 1

page 3

page 4

page 8

page 9

research
05/28/2016

Video Key Frame Extraction using Entropy value as Global and Local Feature

Key frames play an important role in video annotation. It is one of the ...
research
03/28/2019

BubbleNets: Learning to Select the Guidance Frame in Video Object Segmentation by Deep Sorting Frames

Semi-supervised video object segmentation has made significant progress ...
research
01/09/2023

Cursive Caption Text Detection in Videos

Textual content appearing in videos represents an interesting index for ...
research
04/06/2021

A New Dimension in Testimony: Relighting Video with Reflectance Field Exemplars

We present a learning-based method for estimating 4D reflectance field o...
research
09/16/2023

FrameRS: A Video Frame Compression Model Composed by Self supervised Video Frame Reconstructor and Key Frame Selector

In this paper, we present frame reconstruction model: FrameRS. It consis...
research
01/29/2023

Maximal Cliques on Multi-Frame Proposal Graph for Unsupervised Video Object Segmentation

Unsupervised Video Object Segmentation (UVOS) aims at discovering object...
research
08/03/2017

Unsupervised Video Understanding by Reconciliation of Posture Similarities

Understanding human activity and being able to explain it in detail surp...

Please sign up or login with your details

Forgot password? Click here to reset