Slot Contrastive Networks: A Contrastive Approach for Representing Objects

07/18/2020
by   Evan Racah, et al.
6

Unsupervised extraction of objects from low-level visual data is an important goal for further progress in machine learning. Existing approaches for representing objects without labels use structured generative models with static images. These methods focus a large amount of their capacity on reconstructing unimportant background pixels, missing low contrast or small objects. Conversely, we present a new method that avoids losses in pixel space and over-reliance on the limited signal a static image provides. Our approach takes advantage of objects' motion by learning a discriminative, time-contrastive loss in the space of slot representations, attempting to force each slot to not only capture entities that move, but capture distinct objects from the other slots. Moreover, we introduce a new quantitative evaluation metric to measure how "diverse" a set of slot vectors are, and use it to evaluate our model on 20 Atari games.

READ FULL TEXT
research
05/18/2023

SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models

Object-centric learning aims to represent visual data with a set of obje...
research
01/05/2023

CRADL: Contrastive Representations for Unsupervised Anomaly Detection and Localization

Unsupervised anomaly detection in medical imaging aims to detect and loc...
research
08/09/2023

Slot Induction via Pre-trained Language Model Probing and Multi-level Contrastive Learning

Recent advanced methods in Natural Language Understanding for Task-orien...
research
06/18/2021

Contrastive Learning of Generalized Game Representations

Representing games through their pixels offers a promising approach for ...
research
02/27/2023

Revisit Out-Of-Vocabulary Problem for Slot Filling: A Unified Contrastive Frameword with Multi-level Data Augmentations

In real dialogue scenarios, the existing slot filling model, which tends...
research
01/11/2021

Evaluating Disentanglement of Structured Latent Representations

We design the first multi-layer disentanglement metric operating at all ...
research
10/04/2022

SLOT-V: Supervised Learning of Observer Models for Legible Robot Motion Planning in Manipulation

We present SLOT-V, a novel supervised learning framework that learns obs...

Please sign up or login with your details

Forgot password? Click here to reset