Disentangling What and Where for 3D Object-Centric Representations Through Active Inference

08/26/2021
by   Toon Van de Maele, et al.
0

Although modern object detection and classification models achieve high accuracy, these are typically constrained in advance on a fixed train set and are therefore not flexible to deal with novel, unseen object categories. Moreover, these models most often operate on a single frame, which may yield incorrect classifications in case of ambiguous viewpoints. In this paper, we propose an active inference agent that actively gathers evidence for object classifications, and can learn novel object categories over time. Drawing inspiration from the human brain, we build object-centric generative models composed of two information streams, a what- and a where-stream. The what-stream predicts whether the observed object belongs to a specific category, while the where-stream is responsible for representing the object in its internal 3D reference frame. We show that our agent (i) is able to learn representations for many object categories in an unsupervised way, (ii) achieves state-of-the-art classification accuracies, actively resolving ambiguity when required and (iii) identifies novel object categories. Furthermore, we validate our system in an end-to-end fashion where the agent is able to search for an object at a given pose from a pixel-based rendering. We believe that this is a first step towards building modular, intelligent systems that can be used for a wide range of tasks involving three dimensional objects.

READ FULL TEXT

page 5

page 6

page 16

research
02/07/2023

Object-Centric Scene Representations using Active Inference

Representing a scene and its constituent objects from raw sensory data i...
research
10/11/2017

Detect to Track and Track to Detect

Recent approaches for high accuracy detection and tracking of object cat...
research
03/28/2023

CARTO: Category and Joint Agnostic Reconstruction of ARTiculated Objects

We present CARTO, a novel approach for reconstructing multiple articulat...
research
01/31/2015

Category-Epitomes : Discriminatively Minimalist Representations for Object Categories

Freehand line sketches are an interesting and unique form of visual repr...
research
09/16/2022

Disentangling Shape and Pose for Object-Centric Deep Active Inference Models

Active inference is a first principles approach for understanding the br...
research
04/14/2023

Symmetry and Complexity in Object-Centric Deep Active Inference Models

Humans perceive and interact with hundreds of objects every day. In doin...
research
04/26/2022

Stochastic Coherence Over Attention Trajectory For Continuous Learning In Video Streams

Devising intelligent agents able to live in an environment and learn by ...

Please sign up or login with your details

Forgot password? Click here to reset