Vasu Sharma | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Jitendra Malik
139 publications
Julien Mairal
76 publications
Gabriel Synnaeve
73 publications
Armand Joulin
70 publications
Dilek Hakkani-Tur
65 publications
Mike Lewis
64 publications
Gaurav S. Sukhatme
60 publications
Hervé Jégou
59 publications
Katia Sycara
56 publications
Ishan Misra
46 publications
Piotr Bojanowski
41 publications

research

∙ 04/14/2023

DINOv2: Learning Robust Visual Features without Supervision

The recent breakthroughs in natural language processing for model pretra...

1 Maxime Oquab, et al. ∙

research

∙ 03/02/2023

Alexa Arena: A User-Centric Interactive Platform for Embodied AI

We introduce Alexa Arena, a user-centric simulation platform for Embodie...

0 Qiaozi Gao, et al. ∙

research

∙ 12/15/2022

MAViL: Masked Audio-Video Learners

We present Masked Audio-Video Learners (MAViL) to train audio-visual rep...

0 Po-Yao Huang, et al. ∙

research

∙ 08/26/2022

CH-MARL: A Multimodal Benchmark for Cooperative, Heterogeneous Multi-Agent Reinforcement Learning

We propose a multimodal (vision-and-language) benchmark for cooperative ...

0 Vasu Sharma, et al. ∙

research

∙ 08/10/2018

Community Regularization of Visually-Grounded Dialog

The task of conducting visually grounded dialog involves learning goal-o...

0 Akshat Agarwal, et al. ∙

research

∙ 08/10/2018

Mind Your Language: Learning Visually Grounded Dialog in a Multi-Agent Setting

The task of visually grounded dialog involves learning goal-oriented coo...

0 Akshat Agarwal, et al. ∙

Success!

An error occurred