Samir Yitzhak Gadre

research

∙ 07/19/2023

Improving Multimodal Datasets with Image Captioning

Massive web datasets play a key role in the success of large vision-lang...

0 Thao Nguyen, et al. ∙

research

∙ 07/11/2023

Objaverse-XL: A Universe of 10M+ 3D Objects

Natural language processing and 2D vision models have attained remarkabl...

1 Matt Deitke, et al. ∙

research

∙ 04/27/2023

DataComp: In search of the next generation of multimodal datasets

Large multimodal datasets have been instrumental in recent breakthroughs...

0 Samir Yitzhak Gadre, et al. ∙

research

∙ 08/10/2022

Patching open-vocabulary models by interpolating weights

Open-vocabulary models like CLIP achieve high accuracy across many image...

10 Gabriel Ilharco, et al. ∙

research

∙ 07/19/2022

Structure from Action: Learning Interactions for Articulated Object 3D Structure Discovery

Articulated objects are abundant in daily life. Discovering their parts,...

0 Neil Nie, et al. ∙

research

∙ 03/31/2022

Continuous Scene Representations for Embodied AI

We propose Continuous Scene Representations (CSR), a scene representatio...

0 Samir Yitzhak Gadre, et al. ∙

research

∙ 03/20/2022

CLIP on Wheels: Zero-Shot Object Navigation as Object Localization and Exploration

Households across the world contain arbitrary objects: from mate gourds ...

5 Samir Yitzhak Gadre, et al. ∙

research

∙ 03/10/2022

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time

The conventional recipe for maximizing model accuracy is to (1) train mu...

10 Mitchell Wortsman, et al. ∙

research

∙ 05/03/2021

Act the Part: Learning Interaction Strategies for Articulated Object Part Discovery

People often use physical intuition when manipulating articulated object...

0 Samir Yitzhak Gadre, et al. ∙

Samir Yitzhak Gadre

Featured Co-authors

Sign in with Google

Consider DeepAI Pro