Objects play a crucial role in our everyday activities. Though multisens...
We introduce the visual acoustic matching task, in which an audio clip i...
Binaural audio provides human listeners with an immersive spatial sound
...
Multisensory object-centric perception, reasoning, and interaction have ...
We introduce a new approach for audio-visual speech separation. Given a
...
In audio-visual navigation, an agent intelligently travels through a com...
Several animal species (e.g., bats, dolphins, and whales) and even visua...
In the face of the video data deluge, today's expensive clip-level
class...
Learning how objects sound from video is challenging, since they often
h...
Binaural audio provides a listener with 3D sound sensation, allowing a r...
Perceiving a scene most fully requires all the senses. Yet modeling how
...
Existing methods to recognize actions in static images take the images a...
While machine learning approaches to image restoration offer great promi...
Supervised (pre-)training currently yields state-of-the-art performance ...