Structured Object-Aware Physics Prediction for Video Modeling and Planning

10/06/2019
by   Jannik Kossen, et al.
0

When humans observe a physical system, they can easily locate objects, understand their interactions, and anticipate future behavior, even in settings with complicated and previously unseen interactions. For computers, however, learning such models from videos in an unsupervised fashion is an unsolved research problem. In this paper, we present STOVE, a novel state-space model for videos, which explicitly reasons about objects and their positions, velocities, and interactions. It is constructed by combining an image model and a dynamics model in compositional manner and improves on previous work by reusing the dynamics model for inference, accelerating and regularizing training. STOVE predicts videos with convincing physical behavior over hundreds of timesteps, outperforms previous unsupervised models, and even approaches the performance of supervised baselines. We further demonstrate the strength of our model as a simulator for sample efficient model-based control in a task with heavily interacting objects.

READ FULL TEXT
research
05/23/2016

Unsupervised Learning for Physical Interaction through Video Prediction

A core challenge for an agent learning to interact with the world is to ...
research
07/20/2019

Unsupervised Separation of Dynamics from Pixels

We present an approach to learn the dynamics of multiple objects from im...
research
05/02/2022

ComPhy: Compositional Physical Reasoning of Objects and Events from Videos

Objects' motions in nature are governed by complex interactions and thei...
research
07/16/2021

Towards an Interpretable Latent Space in Structured Models for Video Prediction

We focus on the task of future frame prediction in video governed by und...
research
05/14/2018

Unsupervised Intuitive Physics from Visual Observations

While learning models of intuitive physics is an increasingly active are...
research
08/27/2016

Learning Temporal Transformations From Time-Lapse Videos

Based on life-long observations of physical, chemical, and biologic phen...
research
05/05/2022

Fixing Malfunctional Objects With Learned Physical Simulation and Functional Prediction

This paper studies the problem of fixing malfunctional 3D objects. While...

Please sign up or login with your details

Forgot password? Click here to reset