ShapeStacks: Learning Vision-Based Physical Intuition for Generalised Object Stacking

04/21/2018
by   Oliver Groth, et al.
0

Physical intuition is pivotal for intelligent agents to perform complex tasks. In this paper we investigate the passive acquisition of an intuitive understanding of physical principles as well as the active utilisation of this intuition in the context of generalised object stacking. To this end, we provide: a simulation-based dataset featuring 20,000 stack configurations composed of a variety of elementary geometric primitives richly annotated regarding semantics and structural stability. We train visual classifiers for binary stability prediction on the ShapeStacks data and scrutinise their learned physical intuition. Due to the richness of the training data our approach also generalises favourably to real-world scenarios achieving state-of-the-art stability prediction on a publicly available benchmark of block towers. We then leverage the physical intuition learned by our model to actively construct stable stacks and observe the emergence of an intuitive notion of stackability - an inherent object affordance - induced by the active stacking task. Our approach performs well even in challenging conditions where it considerably exceeds the stack height observed during training or in cases where initially unstable structures must be stabilised via counterbalancing.

READ FULL TEXT

page 2

page 4

page 10

page 11

page 14

research
03/03/2016

Learning Physical Intuition of Block Towers by Example

Wooden blocks are a common toy for infants, allowing them to develop mot...
research
09/13/2018

Physical Primitive Decomposition

Objects are made of parts, each with distinct geometry, physics, functio...
research
06/15/2021

Physion: Evaluating Physical Prediction from Vision in Humans and Machines

While machine learning algorithms excel at many challenging visual tasks...
research
05/10/2019

Support Relation Analysis for Objects in Multiple View RGB-D Images

Understanding physical relations between objects, especially their suppo...
research
12/12/2016

Generalizable Features From Unsupervised Learning

Humans learn a predictive model of the world and use this model to reaso...
research
03/31/2016

To Fall Or Not To Fall: A Visual Approach to Physical Stability Prediction

Understanding physical phenomena is a key competence that enables humans...
research
06/25/2023

Optimal and Stable Multi-Layer Object Rearrangement on a Tabletop

Object rearrangement is a fundamental sub-task in accomplishing a great ...

Please sign up or login with your details

Forgot password? Click here to reset