Effectively localizing an agent in a realistic, noisy setting is crucial...
To achieve autonomy in a priori unknown real-world scenarios, agents sho...
When developing deep learning models, we usually decide what task we wan...
We propose a pre-training strategy called Multi-modal Multi-task Masked
...
Abstraction is at the heart of sketching due to the simple and minimal n...
Transfer learning has witnessed remarkable progress in recent years, for...
This paper introduces a pipeline to parametrically sample and render
mul...
We present a method for making neural network predictions robust to shif...
Vision-based robotics often separates the control loop into one module f...
Visual perception entails solving a wide set of tasks, e.g., object
dete...
When training a neural network for a desired task, one may prefer to ada...
How much does having visual priors about the world (e.g. the fact that t...
Developing visual perception models for active agents and sensorimotor
c...
Do visual tasks have a relationship, or are they unrelated? For instance...
Human communication typically has an underlying structure. This is refle...