Unsupervised Image Decomposition with Phase-Correlation Networks

10/07/2021
by   Angel Villar-Corrales, et al.
17

The ability to decompose scenes into their object components is a desired property for autonomous agents, allowing them to reason and act in their surroundings. Recently, different methods have been proposed to learn object-centric representations from data in an unsupervised manner. These methods often rely on latent representations learned by deep neural networks, hence requiring high computational costs and large amounts of curated data. Such models are also difficult to interpret. To address these challenges, we propose the Phase-Correlation Decomposition Network (PCDNet), a novel model that decomposes a scene into its object components, which are represented as transformed versions of a set of learned object prototypes. The core building block in PCDNet is the Phase-Correlation Cell (PC Cell), which exploits the frequency-domain representation of the images in order to estimate the transformation between an object prototype and its transformed version in the image. In our experiments, we show how PCDNet outperforms state-of-the-art methods for unsupervised object discovery and segmentation on simple benchmark datasets and on more challenging data, while using a small number of learnable parameters and being fully interpretable.

READ FULL TEXT

page 2

page 9

page 11

page 12

page 19

page 20

page 21

page 22

research
07/16/2021

Unsupervised Discovery of Object Radiance Fields

We study the problem of inferring an object-centric scene representation...
research
06/12/2020

Unmasking the Inductive Biases of Unsupervised Object Representations for Video Sequences

Perceiving the world in terms of objects is a crucial prerequisite for r...
research
09/29/2022

Bridging the Gap to Real-World Object-Centric Learning

Humans naturally decompose their environment into entities at the approp...
research
04/29/2021

Unsupervised Layered Image Decomposition into Object Prototypes

We present an unsupervised learning framework for decomposing images int...
research
04/05/2022

Complex-Valued Autoencoders for Object Discovery

Object-centric representations form the basis of human perception and en...
research
01/22/2019

MONet: Unsupervised Scene Decomposition and Representation

The ability to decompose scenes in terms of abstract building blocks is ...
research
05/24/2023

Contrastive Training of Complex-Valued Autoencoders for Object Discovery

Current state-of-the-art object-centric models use slots and attention-b...

Please sign up or login with your details

Forgot password? Click here to reset