Video Pixel Networks

by   Nal Kalchbrenner, et al.

We propose a probabilistic video model, the Video Pixel Network (VPN), that estimates the discrete joint distribution of the raw pixel values in a video. The model and the neural architecture reflect the time, space and color structure of video tensors and encode it as a four-dimensional dependency chain. The VPN approaches the best possible performance on the Moving MNIST benchmark, a leap over the previous state of the art, and the generated videos show only minor deviations from the ground truth. The VPN also produces detailed samples on the action-conditional Robotic Pushing benchmark and generalizes to the motion of novel objects.


page 11

page 12

page 13

page 14

page 15

page 16


Attentioned Convolutional LSTM InpaintingNetwork for Anomaly Detection in Videos

We propose a semi-supervised model for detecting anomalies in videos ins...

PReMVOS: Proposal-generation, Refinement and Merging for Video Object Segmentation

We address semi-supervised video object segmentation, the task of automa...

Box Supervised Video Segmentation Proposal Network

Video Object Segmentation (VOS) has been targeted by various fully-super...

Exploring Intra- and Inter-Video Relation for Surgical Semantic Scene Segmentation

Automatic surgical scene segmentation is fundamental for facilitating co...

Application Of ADNN For Background Subtraction In Smart Surveillance System

Object movement identification is one of the most researched problems in...

Training Adversarial Discriminators for Cross-channel Abnormal Event Detection in Crowds

Abnormal crowd behaviour detection attracts a large interest due to its ...

Extended probabilistic Rand index and the adjustable moving window-based pixel-pair sampling method

The probabilistic Rand (PR) index has the following three problems: It l...

Please sign up or login with your details

Forgot password? Click here to reset