A Naturalness Evaluation Database for Video Prediction Models

05/01/2020
by   Nagabhushan Somraj, et al.
1

The study of video prediction models is believed to be a fundamental approach to representation learning for videos. While a plethora of generative models for predicting the future frame pixel values given the past few frames exist, the quantitative evaluation of the predicted frames has been found to be extremely challenging. In this context, we introduce the problem of naturalness evaluation, which refers to how natural or realistic a predicted video looks. We create the Indian Institute of Science Video Naturalness Evaluation (IISc VINE) Database consisting of 300 videos, obtained by applying different prediction models on different datasets, and accompanying human opinion scores. 50 human subjects participated in our study yielding around 6000 human ratings of naturalness. Our subjective study reveals that human observers show a highly consistent judgement of naturalness. We benchmark several popularly used measures for evaluating video prediction and show that they do not adequately correlate with the subjective scores. We introduce two new features to help effectively capture naturalness. In particular, we show that motion compensated cosine similarities of deep features of predicted frames with past frames and deep features extracted from rescaled frame differences lead to state of the art naturalness prediction in accordance with human judgements. The database and code will be made publicly available at our project website: https://sites.google.com/site/nagabhushansn95/publications/vine.

READ FULL TEXT

page 1

page 2

page 4

page 7

page 10

research
04/27/2016

Deep Learning for Saliency Prediction in Natural Video

The purpose of this paper is the detection of salient areas in natural v...
research
07/05/2018

Consistent Generative Query Networks

Stochastic video prediction is usually framed as an extrapolation proble...
research
11/02/2018

SDCNet: Video Prediction Using Spatially-Displaced Convolution

We present an approach for high-resolution video frame prediction by con...
research
10/07/2019

Action-conditioned Benchmarking of Robotic Video Prediction Models: a Comparative Study

A defining characteristic of intelligent systems is the ability to make ...
research
01/29/2017

Transformation-Based Models of Video Sequences

In this work we propose a simple unsupervised approach for next frame pr...
research
04/20/2021

Bias-Aware Loss for Training Image and Speech Quality Prediction Models from Multiple Datasets

The ground truth used for training image, video, or speech quality predi...
research
05/17/2021

Adaptive Video Encoding For Different Video Codecs

By 2022, we expect video traffic to reach 82 Undoubtedly, the abundance ...

Please sign up or login with your details

Forgot password? Click here to reset