Zero-Episode Few-Shot Contrastive Predictive Coding: Solving intelligence tests without prior training

05/04/2022
by   T. Barak, et al.
0

Video prediction models often combine three components: an encoder from pixel space to a small latent space, a latent space prediction model, and a generative model back to pixel space. However, the large and unpredictable pixel space makes training such models difficult, requiring many training examples. We argue that finding a predictive latent variable and using it to evaluate the consistency of a future image enables data-efficient predictions because it precludes the necessity of a generative model training. To demonstrate it, we created sequence completion intelligence tests in which the task is to identify a predictably changing feature in a sequence of images and use this prediction to select the subsequent image. We show that a one-dimensional Markov Contrastive Predictive Coding (M-CPC_1D) model solves these tests efficiently, with only five examples. Finally, we demonstrate the usefulness of M-CPC_1D in solving two tasks without prior training: anomaly detection and stochastic movement video prediction.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 7

page 9

research
05/24/2022

Naive Few-Shot Learning: Sequence Consistency Evaluation

Cognitive psychologists often use the term fluid intelligence to describ...
research
07/10/2018

Representation Learning with Contrastive Predictive Coding

While supervised learning has enabled great progress in many application...
research
11/14/2017

Prediction Under Uncertainty with Error-Encoding Networks

In this work we introduce a new framework for performing temporal predic...
research
04/24/2021

Aligned Contrastive Predictive Coding

We investigate the possibility of forcing a self-supervised model traine...
research
08/19/2022

Wildfire Forecasting with Satellite Images and Deep Generative Model

Wildfire forecasting has been one of the most critical tasks that humani...
research
07/16/2021

Towards an Interpretable Latent Space in Structured Models for Video Prediction

We focus on the task of future frame prediction in video governed by und...
research
10/10/2021

Towards High-fidelity Singing Voice Conversion with Acoustic Reference and Contrastive Predictive Coding

Recently, phonetic posteriorgrams (PPGs) based methods have been quite p...

Please sign up or login with your details

Forgot password? Click here to reset