We study how vision-language models trained on Internet-scale data can b...
Recent advances in robot learning have shown promise in enabling robots ...
By transferring knowledge from large, diverse, task-agnostic datasets, m...
In recent years, much progress has been made in learning robotic manipul...
Large language models can encode a wealth of semantic knowledge about th...
It has been recognized that the joint training of computer vision tasks ...