For robots to be useful outside labs and specialized factories we need a...
The ability to leverage heterogeneous robotic experience from different
...
We present a novel model for Tracking Any Point (TAP) that effectively t...
We propose a novel multimodal video benchmark - the Perception Test - to...
Recent works have shown that large models pretrained on common visual
le...
Generic motion understanding from video involves not only tracking objec...
For robots operating in the real world, it is desirable to learn reusabl...
Complex sequential tasks in continuous-control settings often require ag...
Self-supervised learning algorithms based on instance discrimination tra...
Imitation learning is an effective tool for robotic learning tasks where...
The sense of touch is fundamental in several manipulation tasks, but rar...
In offline reinforcement learning (RL) agents are trained using a logged...
Behavior cloning (BC) is often practical for robot learning because it a...
We describe a system for large-scale audiovisual translation and dubbing...
We present an approach for estimating the period with which an action is...
Collecting and automatically obtaining reward signals from real robotic
...
We present a framework for data-driven robotics that makes use of a larg...
We introduce a self-supervised representation learning method based on t...
In this paper, we introduce Recipe1M, a new large-scale, structured corp...
Humans are experts at high-fidelity imitation -- closely mimicking a
dem...
Deep reinforcement learning methods traditionally struggle with tasks wh...
In this paper, we introduce a new regularization technique for transfer
...
We capitalize on large amounts of readily-available, synchronous data to...
A person's weight status can have profound implications on their life,
r...
Food is an integral part of our life and what and how much we eat crucia...
People can recognize scenes across many different modalities beyond natu...
We learn rich natural sound representations by capitalizing on large amo...
Age and gender are complementary soft biometric traits for face recognit...
People can recognize scenes across many different modalities beyond natu...