Visual Prediction of Priors for Articulated Object Interaction

06/06/2020
by   Caris Moses, et al.
0

Exploration in novel settings can be challenging without prior experience in similar domains. However, humans are able to build on prior experience quickly and efficiently. Children exhibit this behavior when playing with toys. For example, given a toy with a yellow and blue door, a child will explore with no clear objective, but once they have discovered how to open the yellow door, they will most likely be able to open the blue door much faster. Adults also exhibit this behavior when entering new spaces such as kitchens. We develop a method, Contextual Prior Prediction, which provides a means of transferring knowledge between interactions in similar domains through vision. We develop agents that exhibit exploratory behavior with increasing efficiency, by learning visual features that are shared across environments, and how they correlate to actions. Our problem is formulated as a Contextual Multi-Armed Bandit where the contexts are images, and the robot has access to a parameterized action space. Given a novel object, the objective is to maximize reward with few interactions. A domain which strongly exhibits correlations between visual features and motion is kinemetically constrained mechanisms. We evaluate our method on simulated prismatic and revolute joints.

READ FULL TEXT

page 1

page 5

page 6

research
12/08/2022

VideoDex: Learning Dexterity from Internet Videos

To build general robotic agents that can operate in many environments, i...
research
09/10/2017

Bayesian bandits: balancing the exploration-exploitation tradeoff via double sampling

Reinforcement learning studies how to balance exploration and exploitati...
research
11/16/2020

Corrupted Contextual Bandits with Action Order Constraints

We consider a variant of the novel contextual bandit problem with corrup...
research
03/23/2020

Contextual Bandit-Based Channel Selection for Wireless LANs with Interference-Driven Feature Extraction

This paper proposes a radio channel selection algorithm based on a conte...
research
06/17/2022

Efficiently Learning Single-Arm Fling Motions to Smooth Garments

Recent work has shown that 2-arm "fling" motions can be effective for ga...
research
04/07/2021

On the Critical Role of Conventions in Adaptive Human-AI Collaboration

Humans can quickly adapt to new partners in collaborative tasks (e.g. pl...
research
07/20/2022

Visual Knowledge Tracing

Each year, thousands of people learn new visual categorization tasks – r...

Please sign up or login with your details

Forgot password? Click here to reset