Justin Johnson

research

∙ 07/14/2023

NIFTY: Neural Object Interaction Fields for Guided Human Motion Synthesis

We address the problem of generating realistic 3D motions of humans inte...

0 Nilesh Kulkarni, et al. ∙

research

∙ 06/14/2023

Learning to Predict Scene-Level Implicit 3D from Posed RGBD Data

We introduce a method that can learn to predict scene-level implicit fun...

0 Nilesh Kulkarni, et al. ∙

research

∙ 06/12/2023

Scalable 3D Captioning with Pretrained Models

We introduce Cap3D, an automatic approach for generating descriptive tex...

0 Tiange Luo, et al. ∙

research

∙ 04/18/2023

Hyperbolic Image-Text Representations

Visual and linguistic concepts naturally organize themselves in a hierar...

0 Karan Desai, et al. ∙

research

∙ 03/21/2023

Text2Room: Extracting Textured 3D Meshes from 2D Text-to-Image Models

We present Text2Room, a method for generating room-scale textured 3D mes...

0 Lukas Höllein, et al. ∙

research

∙ 02/23/2023

Learning Visual Representations via Language-Guided Sampling

Although an object may appear in numerous contexts, we often describe it...

0 Mohamed El Banani, et al. ∙

research

∙ 01/23/2023

HexPlane: A Fast Representation for Dynamic Scenes

Modeling and re-rendering dynamic 3D scenes is a challenging task in 3D ...

0 Ang Cao, et al. ∙

research

∙ 01/19/2023

Multiview Compressive Coding for 3D Reconstruction

A central goal of visual recognition is to understand objects and scenes...

11 Chao-Yuan Wu, et al. ∙

research

∙ 12/25/2022

Neural Shape Compiler: A Unified Framework for Transforming between Text, Point Cloud, and Program

3D shapes have complementary abstractions from low-level geometry to par...

7 Tiange Luo, et al. ∙

research

∙ 12/06/2022

Self-Supervised Correspondence Estimation via Multiview Registration

Video provides us with the spatio-temporal consistency needed for visual...

0 Mohamed El Banani, et al. ∙

research

∙ 11/29/2022

RGB no more: Minimally-decoded JPEG Vision Transformers

Most neural networks for computer vision are designed to infer using RGB...

0 Jeongsoo Park, et al. ∙

research

∙ 08/18/2022

The 8-Point Algorithm as an Inductive Bias for Relative Pose Prediction by ViTs

We present a simple baseline for directly estimating the relative pose (...

9 Chris Rockwell, et al. ∙

research

∙ 07/21/2022

Omni3D: A Large Benchmark and Model for 3D Object Detection in the Wild

Recognizing scenes and objects in 3D from a single image is a longstandi...

0 Garrick Brazil, et al. ∙

research

∙ 06/16/2022

FWD: Real-time Novel View Synthesis with Forward Warping and Depth

Novel view synthesis (NVS) is a challenging task requiring systems to ge...

0 Ang Cao, et al. ∙

research

∙ 06/14/2022

Learning 3D Object Shape and Layout without 3D Supervision

A 3D scene consists of a set of objects, each with a shape and a layout ...

0 Georgia Gkioxari, et al. ∙

research

∙ 12/08/2021

What's Behind the Couch? Directed Ray Distance Functions (DRDF) for 3D Scene Reconstruction

We present an approach for full 3D scene reconstruction from a single un...

0 Nilesh Kulkarni, et al. ∙

research

∙ 12/02/2021

StyleMesh: Style Transfer for Indoor 3D Scene Reconstructions

We apply style transfer on mesh reconstructions of indoor scenes. This e...

0 Lukas Höllein, et al. ∙

research

∙ 12/02/2021

Recognizing Scenes from Novel Viewpoints

Humans can perceive scenes in 3D from a handful of 2D views. For AI agen...

0 Shengyi Qian, et al. ∙

research

∙ 11/22/2021

RedCaps: web-curated image-text data created by the people, for the people

Large datasets of paired images and text have become increasingly popula...

4 Karan Desai, et al. ∙

research

∙ 08/12/2021

PixelSynth: Generating a 3D-Consistent Experience from a Single Image

Recent advancements in differentiable rendering and 3D reasoning have dr...

4 Chris Rockwell, et al. ∙

research

∙ 06/26/2021

Inverting and Understanding Object Detectors

As a core problem in computer vision, the performance of object detectio...

16 Ang Cao, et al. ∙

research

∙ 06/01/2021

Bootstrap Your Own Correspondences

Geometric feature extraction is a crucial component of point cloud regis...

5 Mohamed El Banani, et al. ∙

research

∙ 05/17/2021

Rethinking "Batch" in BatchNorm

BatchNorm is a critical building block in modern convolutional neural ne...

0 Yuxin Wu, et al. ∙

research

∙ 02/23/2021

UnsupervisedR R: Unsupervised Point Cloud Registration via Differentiable Rendering

Aligning partial views of a scene into a single whole is essential to un...

0 Mohamed El Banani, et al. ∙

research

∙ 12/08/2020

CASTing Your Model: Learning to Localize Improves Self-Supervised Representations

Recent advances in self-supervised learning (SSL) have largely closed th...

13 Ramprasaath R. Selvaraju, et al. ∙

research

∙ 07/16/2020

Accelerating 3D Deep Learning with PyTorch3D

Deep learning has significantly improved 2D image recognition. Extending...

61 Nikhila Ravi, et al. ∙

research

∙ 06/11/2020

VirTex: Learning Visual Representations from Textual Annotations

The de-facto approach to many vision tasks is to start from pretrained v...

16 Karan Desai, et al. ∙

research

∙ 12/18/2019

SynSin: End-to-end View Synthesis from a Single Image

Single image view synthesis allows for the generation of new views of a ...

17 Olivia Wiles, et al. ∙

research

∙ 11/21/2019

Temporal Reasoning via Audio Question Answering

Multimodal question answering tasks can be used as proxy tasks to study ...

0 Haytham M. Fayek, et al. ∙

research

∙ 08/15/2019

PHYRE: A New Benchmark for Physical Reasoning

Understanding and reasoning about physics is an important ability of int...

6 Anton Bakhtin, et al. ∙

research

∙ 06/06/2019

Mesh R-CNN

Rapid advances in 2D perception have led to systems that accurately dete...

5 Georgia Gkioxari, et al. ∙

research

∙ 05/30/2019

On Network Design Spaces for Visual Recognition

Over the past several years progress in designing better neural network ...

16 Ilija Radosavovic, et al. ∙

research

∙ 07/26/2018

HiDDeN: Hiding Data With Deep Networks

Recent work has shown that deep neural networks are highly sensitive to ...

0 Jiren Zhu, et al. ∙

research

∙ 04/04/2018

Image Generation from Scene Graphs

To truly understand the visual world our models should be able not only ...

0 Justin Johnson, et al. ∙

research

∙ 03/30/2018

DDRprog: A CLEVR Differentiable Dynamic Reasoning Programmer

We present a novel Dynamic Differentiable Reasoning (DDR) framework for ...

0 Joseph Suarez, et al. ∙

research

∙ 03/29/2018

Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks

Understanding human motion behavior is critical for autonomous moving pl...

0 Agrim Gupta, et al. ∙

research

∙ 05/10/2017

Inferring and Executing Programs for Visual Reasoning

Existing methods for visual reasoning attempt to directly map inputs to ...

0 Justin Johnson, et al. ∙

research

∙ 05/05/2017

Characterizing and Improving Stability in Neural Style Transfer

Recent progress in style transfer on images has focused on improving the...

0 Agrim Gupta, et al. ∙

research

∙ 12/20/2016

CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

When building artificial intelligence systems that can reason and answer...

0 Justin Johnson, et al. ∙

research

∙ 11/20/2016

A Hierarchical Approach for Generating Descriptive Image Paragraphs

Recent progress on image captioning has made it possible to generate nov...

0 Jonathan Krause, et al. ∙

research

∙ 03/27/2016

Perceptual Losses for Real-Time Style Transfer and Super-Resolution

We consider image transformation problems, where an input image is trans...

0 Justin Johnson, et al. ∙

research

∙ 02/23/2016

Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations

Despite progress in perceptual tasks such as image classification, compu...

0 Ranjay Krishna, et al. ∙

research

∙ 11/24/2015

DenseCap: Fully Convolutional Localization Networks for Dense Captioning

We introduce the dense captioning task, which requires a computer vision...

1 Justin Johnson, et al. ∙

research

∙ 08/30/2015

Love Thy Neighbors: Image Annotation by Exploiting Image Metadata

Some images that are difficult to recognize on their own may become more...

1 Justin Johnson, et al. ∙

research

∙ 06/05/2015

Visualizing and Understanding Recurrent Networks

Recurrent Neural Networks (RNNs), and specifically a variant with Long S...

0 Andrej Karpathy, et al. ∙

Justin Johnson

Featured Co-authors

Sign in with Google

Consider DeepAI Pro