Language-Mediated, Object-Centric Representation Learning

12/31/2020
by   Ruocheng Wang, et al.
12

We present Language-mediated, Object-centric Representation Learning (LORL), a paradigm for learning disentangled, object-centric scene representations from vision and language. LORL builds upon recent advances in unsupervised object segmentation, notably MONet and Slot Attention. While these algorithms learn an object-centric representation just by reconstructing the input image, LORL enables them to further learn to associate the learned representations to concepts, i.e., words for object categories, properties, and spatial relationships, from language input. These object-centric concepts derived from language facilitate the learning of object-centric representations. LORL can be integrated with various unsupervised segmentation algorithms that are language-agnostic. Experiments show that the integration of LORL consistently improves the object segmentation performance of MONet and Slot Attention on two datasets via the help of language. We also show that concepts learned by LORL, in conjunction with segmentation algorithms such as MONet, aid downstream tasks such as referring expression comprehension.

READ FULL TEXT

page 1

page 4

page 6

page 7

page 11

page 12

research
07/18/2023

Unsupervised Conditional Slot Attention for Object Centric Learning

Extracting object-level representations for downstream reasoning tasks i...
research
12/20/2022

Towards Unsupervised Visual Reasoning: Do Off-The-Shelf Features Know How to Reason?

Recent advances in visual representation learning allowed to build an ab...
research
01/11/2021

Evaluating Disentanglement of Structured Latent Representations

We design the first multi-layer disentanglement metric operating at all ...
research
05/31/2023

Spotlight Attention: Robust Object-Centric Learning With a Spatial Locality Prior

The aim of object-centric vision is to construct an explicit representat...
research
02/16/2023

Object-centric Learning with Cyclic Walks between Parts and Whole

Learning object-centric representations from complex natural environment...
research
07/15/2022

Sparse Relational Reasoning with Object-Centric Representations

We investigate the composability of soft-rules learned by relational neu...
research
06/03/2023

Cycle Consistency Driven Object Discovery

Developing deep learning models that effectively learn object-centric re...

Please sign up or login with your details

Forgot password? Click here to reset