Antonio Torralba

research

∙ 09/07/2023

A Function Interpretation Benchmark for Evaluating Interpretability Methods

Labeling neural network submodules with human-legible descriptions is us...

0 Sarah Schwettmann, et al. ∙

research

∙ 08/10/2023

Follow Anything: Open-set detection, tracking, and following in real-time

Tracking and following objects of interest is critical to several roboti...

0 Alaa Maalouf, et al. ∙

research

∙ 08/03/2023

Multimodal Neurons in Pretrained Text-Only Transformers

Language models demonstrate remarkable capacity to generalize representa...

0 Sarah Schwettmann, et al. ∙

research

∙ 07/14/2023

DreamTeacher: Pretraining Image Backbones with Deep Generative Models

In this work, we introduce a self-supervised feature representation lear...

0 Daiqing Li, et al. ∙

research

∙ 06/08/2023

Background Prompting for Improved Object Depth

Estimating the depth of objects from a single image is a valuable task f...

0 Manel Baradad, et al. ∙

research

∙ 06/08/2023

Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models

Text-to-image generative models have enabled high-resolution image synth...

19 Nan Liu, et al. ∙

research

∙ 05/23/2023

Improving Factuality and Reasoning in Language Models through Multiagent Debate

Large language models (LLMs) have demonstrated remarkable capabilities i...

4 Yilun Du, et al. ∙

research

∙ 05/02/2023

Generalizing Dataset Distillation via Deep Generative Prior

Dataset Distillation aims to distill an entire dataset's knowledge into ...

0 George Cazenavette, et al. ∙

research

∙ 04/22/2023

3D-IntPhys: Towards More Generalized 3D-grounded Visual Intuitive Physics under Challenging Scenes

Given a visual scene, humans have strong intuitions about how a scene ca...

0 Haotian Xue, et al. ∙

research

∙ 04/19/2023

NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models

Automatically generating high-quality real world 3D scenes is of enormou...

0 Seung Wook Kim, et al. ∙

research

∙ 04/03/2023

Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning

In goal-reaching reinforcement learning (RL), the optimal value function...

0 Tongzhou Wang, et al. ∙

research

∙ 03/29/2023

Physics-Driven Diffusion Models for Impact Sound Synthesis from Videos

Modeling sounds emitted from physical object interactions is critical fo...

0 Kun Su, et al. ∙

research

∙ 03/21/2023

Detecting Everything in the Open World: Towards Universal Object Detection

In this paper, we formally address universal object detection, which aim...

0 Zhenyu Wang, et al. ∙

research

∙ 03/20/2023

Open-vocabulary Panoptic Segmentation with Embedding Modulation

Open-vocabulary image segmentation is attracting increasing attention du...

3 Xi Chen, et al. ∙

research

∙ 02/14/2023

ConceptFusion: Open-set Multimodal 3D Mapping

Building 3D maps of the environment is central to robot navigation, plan...

17 Krishna Murthy Jatavallabhula, et al. ∙

research

∙ 01/31/2023

Debiasing Vision-Language Models via Biased Prompts

Machine learning models have been shown to inherit biases from their tra...

0 Ching-Yao Chuang, et al. ∙

research

∙ 01/12/2023

NOPA: Neurally-guided Online Probabilistic Assistance for Building Socially Intelligent Home Assistants

In this work, we study how to build socially intelligent robots to assis...

9 Xavier Puig, et al. ∙

research

∙ 12/22/2022

Aliasing is a Driver of Adversarial Attacks

Aliasing is a highly important concept in signal processing, as careful ...

0 Adrián Rodríguez-Muñoz, et al. ∙

research

∙ 11/29/2022

Procedural Image Programs for Representation Learning

Learning image representations using synthetic data allows training neur...

0 Manel Baradad, et al. ∙

research

∙ 11/08/2022

BT^2: Backward-compatible Training with Basis Transformation

Modern retrieval system often requires recomputing the representation of...

0 Yifei Zhou, et al. ∙

research

∙ 10/20/2022

Composing Ensembles of Pre-trained Models via Iterative Consensus

Large pre-trained models exhibit distinct and complementary capabilities...

1 Shuang Li, et al. ∙

research

∙ 09/26/2022

Totems: Physical Objects for Verifying Visual Integrity

We introduce a new approach to image forensics: placing physical refract...

4 Jingwei Ma, et al. ∙

research

∙ 07/07/2022

Finding Fallen Objects Via Asynchronous Audio-Visual Integration

The way an object looks and sounds provide complementary reflections of ...

6 Chuang Gan, et al. ∙

research

∙ 07/06/2022

Local Relighting of Real Scenes

We introduce the task of local relighting, which changes a photograph of...

0 Audrey Cui, et al. ∙

research

∙ 06/30/2022

Denoised MDPs: Learning World Models Better Than the World Itself

The ability to separate signal from noise, and reason with clean abstrac...

8 Tongzhou Wang, et al. ∙

research

∙ 06/16/2022

Virtual Correspondence: Humans as a Cue for Extreme-View Geometry

Recovering the spatial layout of the cameras and the geometry of the sce...

0 Wei-Chiu Ma, et al. ∙

research

∙ 06/15/2022

Disentangling visual and written concepts in CLIP

The CLIP network measures the similarity between natural text and images...

8 Joanna Materzyńska, et al. ∙

research

∙ 06/06/2022

Polymorphic-GAN: Generating Aligned Samples across Multiple Domains with Learned Morph Maps

Modern image generative models show remarkable sample quality when train...

7 Seung Wook Kim, et al. ∙

research

∙ 06/03/2022

Compositional Visual Generation with Composable Diffusion Models

Large text-guided diffusion models, such as DALLE-2, are able to generat...

8 Nan Liu, et al. ∙

research

∙ 05/05/2022

Fixing Malfunctional Objects With Learned Physical Simulation and Functional Prediction

This paper studies the problem of fixing malfunctional 3D objects. While...

15 Yining Hong, et al. ∙

research

∙ 05/02/2022

ComPhy: Compositional Physical Reasoning of Objects and Events from Videos

Objects' motions in nature are governed by complex interactions and thei...

0 Zhenfang Chen, et al. ∙

research

∙ 04/11/2022

Correcting Robot Plans with Natural Language Feedback

When humans design cost or goal specifications for robots, they often pr...

2 Pratyusha Sharma, et al. ∙

research

∙ 04/04/2022

Learning Neural Acoustic Fields

Our environment is filled with rich and dynamic acoustic information. Wh...

3 Andrew Luo, et al. ∙

research

∙ 03/30/2022

Learning Program Representations for Food Images and Cooking Recipes

In this paper, we are interested in modeling a how-to instructional proc...

0 Dim P. Papadopoulos, et al. ∙

research

∙ 03/22/2022

Dataset Distillation by Matching Training Trajectories

Dataset distillation is the task of synthesizing a small dataset such th...

16 George Cazenavette, et al. ∙

research

∙ 02/03/2022

Pre-Trained Language Models for Interactive Decision-Making

Language model (LM) pre-training has proven useful for a wide variety of...

68 Shuang Li, et al. ∙

research

∙ 01/26/2022

Natural Language Descriptions of Deep Visual Features

Some neurons in deep networks specialize in recognizing highly specific ...

2 Evan Hernandez, et al. ∙

research

∙ 01/12/2022

BigDatasetGAN: Synthesizing ImageNet with Pixel-wise Annotations

Annotating images with pixel-wise labels is a time-consuming and costly ...

11 Daiqing Li, et al. ∙

research

∙ 01/12/2022

Robust Contrastive Learning against Noisy Views

Contrastive learning relies on an assumption that positive pairs contain...

12 Ching-Yao Chuang, et al. ∙

research

∙ 01/11/2022

Incidents1M: a large-scale dataset of images with natural disasters, damage, and incidents

Natural disasters, such as floods, tornadoes, or wildfires, are increasi...

26 Ethan Weber, et al. ∙

research

∙ 12/09/2021

GAN-Supervised Dense Visual Alignment

We propose GAN-Supervised Learning, a framework for learning discriminat...

7 William Peebles, et al. ∙

research

∙ 12/09/2021

PTR: A Benchmark for Part-based Conceptual, Relational, and Physical Reasoning

A critical aspect of human visual perception is the ability to parse vis...

28 Yining Hong, et al. ∙

research

∙ 12/02/2021

Editing a classifier by rewriting its prediction rules

We present a methodology for modifying the behavior of a classifier by d...

13 Shibani Santurkar, et al. ∙

research

∙ 11/17/2021

Learning to Compose Visual Relations

The visual world around us can be described as a structured set of objec...

22 Nan Liu, et al. ∙

research

∙ 11/04/2021

EditGAN: High-Precision Semantic Image Editing

Generative adversarial networks (GANs) have recently found applications ...

8 Huan Ling, et al. ∙

research

∙ 10/13/2021

OPEn: An Open-ended Physics Environment for Learning Without a Task

Humans have mental models that allow them to plan, experiment, and reaso...

57 Chuang Gan, et al. ∙

research

∙ 10/08/2021

Toward a Visual Concept Vocabulary for GAN Latent Space

A large body of recent work has identified transformations in the latent...

11 Sarah Schwettmann, et al. ∙

research

∙ 10/07/2021

Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions

We introduce the task of weakly supervised learning for detecting human ...

11 Shuang Li, et al. ∙

research

∙ 10/05/2021

Scaling up instance annotation via label propagation

Manually annotating object segmentation masks is very time-consuming. Wh...

0 Dim P. Papadopoulos, et al. ∙

research

∙ 10/04/2021

Skill Induction and Planning with Latent Language

We present a framework for learning hierarchical policies from demonstra...

0 Pratyusha Sharma, et al. ∙

Antonio Torralba

Featured Co-authors

Sign in with Google

Consider DeepAI Pro