VirtualHome: Simulating Household Activities via Programs

06/19/2018
by   Xavier Puig, et al.
0

In this paper, we are interested in modeling complex activities that occur in a typical household. We propose to use programs, i.e., sequences of atomic actions and interactions, as a high level representation of complex tasks. Programs are interesting because they provide a non-ambiguous representation of a task, and allow agents to execute them. However, nowadays, there is no database providing this type of information. Towards this goal, we first crowd-source programs for a variety of activities that happen in people's homes, via a game-like interface used for teaching kids how to code. Using the collected dataset, we show how we can learn to extract programs directly from natural language descriptions or from videos. We then implement the most common atomic (inter)actions in the Unity3D game engine, and use our programs to "drive" an artificial agent to execute tasks in a simulated household environment. Our VirtualHome simulator allows us to create a large activity video dataset with rich ground-truth, enabling training and testing of video understanding models. We further showcase examples of our agent performing tasks in our VirtualHome based on language descriptions.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 7

page 8

research
06/28/2021

Unsupervised Discovery of Actions in Instructional Videos

In this paper we address the problem of automatically discovering atomic...
research
06/20/2017

Programmable Agents

We build deep RL agents that execute declarative programs expressed in f...
research
12/20/2022

Parsel: A Unified Natural Language Framework for Algorithmic Reasoning

Despite recent success in large language model (LLM) reasoning, LLMs sti...
research
07/31/2020

LEMMA: A Multi-view Dataset for Learning Multi-agent Multi-task Activities

Understanding and interpreting human actions is a long-standing challeng...
research
01/18/2014

Location-Based Reasoning about Complex Multi-Agent Behavior

Recent research has shown that surprisingly rich models of human activit...
research
03/30/2022

Learning Program Representations for Food Images and Cooking Recipes

In this paper, we are interested in modeling a how-to instructional proc...
research
10/05/2021

Truth-Conditional Captioning of Time Series Data

In this paper, we explore the task of automatically generating natural l...

Please sign up or login with your details

Forgot password? Click here to reset