Bhavan Jasani | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Deva Ramanan
104 publications
Nuno Vasconcelos
58 publications
Rohit Girdhar
30 publications
Kashyap Chitta
20 publications
R. Manmatha
19 publications
Srikar Appalaraju
13 publications
Chih-Hui Ho
11 publications
Yusheng Xie
11 publications
Yash Patel
9 publications
Bhargava Urala Kota
2 publications
Afshaan Mazagonwalla
1 publication

research

∙ 11/15/2022

YORO – Lightweight End to End Visual Grounding

We present YORO - a multi-modal transformer encoder-only architecture fo...

0 Chih-Hui Ho, et al. ∙

research

∙ 06/22/2021

DocFormer: End-to-End Transformer for Document Understanding

We present DocFormer – a multi-modal transformer based architecture for ...

1 Srikar Appalaraju, et al. ∙

research

∙ 11/26/2019

Skeleton based Zero Shot Action Recognition in Joint Pose-Language Semantic Space

How does one represent an action? How does one describe an action that w...

0 Bhavan Jasani, et al. ∙

research

∙ 11/08/2019

Are we asking the right questions in MovieQA?

Joint vision and language tasks like visual question answering are fasci...

0 Bhavan Jasani, et al. ∙

research

∙ 05/19/2018

Learning Sampling Policies for Domain Adaptation

We address the problem of semi-supervised domain adaptation of classific...

0 Yash Patel, et al. ∙

Success!

An error occurred