Stochastic Coherence Over Attention Trajectory For Continuous Learning In Video Streams

04/26/2022
by   Matteo Tiezzi, et al.
2

Devising intelligent agents able to live in an environment and learn by observing the surroundings is a longstanding goal of Artificial Intelligence. From a bare Machine Learning perspective, challenges arise when the agent is prevented from leveraging large fully-annotated dataset, but rather the interactions with supervisory signals are sparsely distributed over space and time. This paper proposes a novel neural-network-based approach to progressively and autonomously develop pixel-wise representations in a video stream. The proposed method is based on a human-like attention mechanism that allows the agent to learn by observing what is moving in the attended locations. Spatio-temporal stochastic coherence along the attention trajectory, paired with a contrastive term, leads to an unsupervised learning criterion that naturally copes with the considered setting. Differently from most existing works, the learned representations are used in open-set class-incremental classification of each frame pixel, relying on few supervisions. Our experiments leverage 3D virtual environments and they show that the proposed agents can learn to distinguish objects just by observing the video stream. Inheriting features from state-of-the art models is not as powerful as one might expect.

READ FULL TEXT

page 5

page 6

page 11

research
04/10/2017

ActionVLAD: Learning spatio-temporal aggregation for action classification

In this work, we introduce a new video representation for action classif...
research
03/14/2022

Attention based Memory video portrait matting

We proposed a novel trimap free video matting method based on the attent...
research
11/30/2020

Adaptive Compact Attention For Few-shot Video-to-video Translation

This paper proposes an adaptive compact attention model for few-shot vid...
research
08/02/2018

Learning Actionable Representations from Visual Observations

In this work we explore a new approach for robots to teach themselves ab...
research
10/04/2017

Feasibility Study: Moving Non-Homogeneous Teams in Congested Video Game Environments

Multi-agent path finding (MAPF) is a well-studied problem in artificial ...
research
09/27/2022

Video-based estimation of pain indicators in dogs

Dog owners are typically capable of recognizing behavioral cues that rev...
research
08/26/2021

Disentangling What and Where for 3D Object-Centric Representations Through Active Inference

Although modern object detection and classification models achieve high ...

Please sign up or login with your details

Forgot password? Click here to reset