Compositional Scene Representation Learning via Reconstruction: A Survey

02/15/2022
by   Jinyang Yuan, et al.
0

Visual scene representation learning is an important research problem in the field of computer vision. The performance on vision tasks could be improved if more suitable representations are learned for visual scenes. Complex visual scenes are the composition of relatively simple visual concepts, and have the property of combinatorial explosion. Compared with directly representing the entire visual scene, extracting compositional scene representations can better cope with the diverse combination of background and objects. Because compositional scene representations abstract the concept of objects, performing visual scene analysis and understanding based on these representations could be easier and more interpretable. Moreover, learning compositional scene representations via reconstruction can greatly reduce the need for training data annotations. Therefore, compositional scene representation learning via reconstruction has important research significance. In this survey, we first discuss representative methods that either learn from a single viewpoint or multiple viewpoints without object-level supervision, then the applications of compositional scene representations, and finally the future directions on this topic.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/07/2021

Unsupervised Learning of Compositional Scene Representations from Multiple Unspecified Viewpoints

Visual scenes are extremely rich in diversity, not only because there ar...
research
07/09/2022

Learning Structured Representations of Visual Scenes

As the intermediate-level representations bridging the two levels, struc...
research
06/13/2022

Compositional Mixture Representations for Vision and Text

Learning a common representation space between vision and language allow...
research
04/22/2019

Superquadrics Revisited: Learning 3D Shape Parsing beyond Cuboids

Abstracting complex 3D shapes with parsimonious part-based representatio...
research
03/23/2023

Learning and generalization of compositional representations of visual scenes

Complex visual scenes that are composed of multiple objects, each with a...
research
03/19/2021

Knowledge-Guided Object Discovery with Acquired Deep Impressions

We present a framework called Acquired Deep Impressions (ADI) which cont...
research
09/15/2022

Compositional Law Parsing with Latent Random Functions

Human cognition has compositionality. We understand a scene by decomposi...

Please sign up or login with your details

Forgot password? Click here to reset