We study how vision-language models trained on Internet-scale data can b...
We describe a system for deep reinforcement learning of robotic manipula...
For robots to follow instructions from people, they must be able to conn...
The predictive information, the mutual information between the past and
...
Imitation learning allows agents to learn complex behaviors from
demonst...
We propose an approach to estimating the 3D pose of a hand, possibly han...
Real world data, especially in the domain of robotics, is notoriously co...
Deep Learning methods usually require huge amounts of training data to
p...
Instrumenting and collecting annotated visual grasping datasets to train...
We propose an entirely data-driven approach to estimating the 3D pose of...
We introduce and evaluate several architectures for Convolutional Neural...
Detecting poorly textured objects and estimating their 3D pose reliably ...