We study how vision-language models trained on Internet-scale data can b...
We consider how to most efficiently leverage teleoperator time to collec...
Large foundation models can exhibit unique capabilities depending on the...
Robotic manipulation can be formulated as inducing a sequence of spatial...
Skilled robotic manipulation benefits from complex synergies between
non...