Exploring the Role of the Bottleneck in Slot-Based Models Through Covariance Regularization

by   Andrew Stange, et al.

In this project we attempt to make slot-based models with an image reconstruction objective competitive with those that use a feature reconstruction objective on real world datasets. We propose a loss-based approach to constricting the bottleneck of slot-based models, allowing larger-capacity encoder networks to be used with Slot Attention without producing degenerate stripe-shaped masks. We find that our proposed method offers an improvement over the baseline Slot Attention model but does not reach the performance of on the COCO2017 dataset. Throughout this project, we confirm the superiority of a feature reconstruction objective over an image reconstruction objective and explore the role of the architectural bottleneck in slot-based models.


page 4

page 6

page 10

page 11

page 12

page 13

page 14


Unsupervised Object-Centric Learning with Bi-Level Optimized Query Slot Attention

The ability to decompose complex natural scenes into meaningful object-c...

PSDet: Efficient and Universal Parking Slot Detection

While real-time parking slot detection plays a critical role in valet pa...

Please sign up or login with your details

Forgot password? Click here to reset