Anima Anandkumar

research

∙ 09/01/2023

Geometry-Informed Neural Operator for Large-Scale 3D PDEs

We propose the geometry-informed neural operator (GINO), a highly effici...

0 Zongyi Li, et al. ∙

research

∙ 08/17/2023

Tipping Point Forecasting in Non-Stationary Dynamics on Function Spaces

Tipping points are abrupt, drastic, and often irreversible changes in th...

0 Miguel Liu-Schiaffini, et al. ∙

research

∙ 08/04/2023

FB-BEV: BEV Representation from Forward-Backward View Transformations

View Transformation Module (VTM), where transformations happen between m...

0 Zhiqi Li, et al. ∙

research

∙ 07/27/2023

Speeding up Fourier Neural Operators via Mixed Precision

The Fourier neural operator (FNO) is a powerful technique for learning s...

0 Colin White, et al. ∙

research

∙ 07/27/2023

Incrementally-Computable Neural Networks: Efficient Inference for Dynamic Inputs

Deep learning often faces the challenge of efficiently processing dynami...

0 Or Sharir, et al. ∙

research

∙ 06/27/2023

LeanDojo: Theorem Proving with Retrieval-Augmented Language Models

Large language models (LLMs) have shown promise in proving formal theore...

0 Kaiyu Yang, et al. ∙

research

∙ 06/20/2023

InRank: Incremental Low-Rank Learning

The theory of greedy low-rank learning (GLRL) aims to explain the impres...

10 Jiawei Zhao, et al. ∙

research

∙ 06/15/2023

Fast Training of Diffusion Models with Masked Transformers

We propose an efficient approach to train large diffusion models with ma...

14 Hongkai Zheng, et al. ∙

research

∙ 06/14/2023

ClimSim: An open large-scale dataset for training high-resolution physics emulators in hybrid multi-scale climate simulators

Modern climate projections lack adequate spatial and temporal resolution...

8 Sungduk Yu, et al. ∙

research

∙ 06/06/2023

Spherical Fourier Neural Operators: Learning Stable Dynamics on the Sphere

Fourier Neural Operators (FNOs) have proven to be an efficient and effec...

6 Boris Bonev, et al. ∙

research

∙ 05/29/2023

Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo

We present a scalable and effective exploration strategy based on Thomps...

3 Haque Ishfaq, et al. ∙

research

∙ 05/25/2023

Voyager: An Open-Ended Embodied Agent with Large Language Models

We introduce Voyager, the first LLM-powered embodied lifelong learning a...

16 Guanzhi Wang, et al. ∙

research

∙ 05/22/2023

Fast Monocular Scene Reconstruction with Global-Sparse Local-Dense Grids

Indoor scene reconstruction from monocular images has long been sought a...

2 Wei Dong, et al. ∙

research

∙ 04/13/2023

Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study

Large decoder-only language models (LMs) can be largely improved in term...

16 Boxin Wang, et al. ∙

research

∙ 03/04/2023

Prismer: A Vision-Language Model with An Ensemble of Experts

Recent vision-language models have shown impressive multi-modal generati...

12 Shikun Liu, et al. ∙

research

∙ 02/23/2023

VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion

Humans can easily imagine the complete 3D geometry of occluded objects a...

11 Yiming Li, et al. ∙

research

∙ 02/14/2023

Score-based Diffusion Models in Function Space

Diffusion models have recently emerged as a powerful framework for gener...

0 Jae Hyun Lim, et al. ∙

research

∙ 02/14/2023

AutoBiasTest: Controllable Sentence Generation for Automated and Open-Ended Social Bias Testing in Language Models

Social bias in Pretrained Language Models (PLMs) affects text generation...

35 Rafal Kocielnik, et al. ∙

research

∙ 02/13/2023

PerAda: Parameter-Efficient and Generalizable Federated Learning Personalization with Guarantees

Personalized Federated Learning (pFL) has emerged as a promising solutio...

1 Chulin Xie, et al. ∙

research

∙ 02/12/2023

I^2SB: Image-to-Image Schrödinger Bridge

We propose Image-to-Image Schrödinger Bridge (I^2SB), a new class of con...

6 Guan-Horng Liu, et al. ∙

research

∙ 02/09/2023

Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning

Augmenting pretrained language models (LMs) with a vision encoder (e.g.,...

0 Zhuolin Yang, et al. ∙

research

∙ 01/19/2023

Forecasting subcritical cylinder wakes with Fourier Neural Operators

We apply Fourier neural operators (FNOs), a state-of-the-art operator le...

33 Peter I Renn, et al. ∙

research

∙ 01/10/2023

Vision Transformers Are Good Mask Auto-Labelers

We propose Mask Auto-Labeler (MAL), a high-quality Transformer-based mas...

13 Shiyi Lan, et al. ∙

research

∙ 12/21/2022

Towards Neural Variational Monte Carlo That Scales Linearly with System Size

Quantum many-body problems are some of the most challenging problems in ...

17 Or Sharir, et al. ∙

research

∙ 11/30/2022

HEAT: Hardware-Efficient Automatic Tensor Decomposition for Transformer Compression

Transformers have attained superior performance in natural language proc...

17 Jiaqi Gu, et al. ∙

research

∙ 11/29/2022

Fourier Continuation for Exact Derivative Computation in Physics-Informed Neural Operators

The physics-informed neural operator (PINO) is a machine learning archit...

15 Haydn Maust, et al. ∙

research

∙ 11/28/2022

Incremental Fourier Neural Operator

Recently, neural networks have proven their impressive ability to solve ...

6 Jiawei Zhao, et al. ∙

research

∙ 11/28/2022

Machine Learning Accelerated PDE Backstepping Observers

State estimation is important for a variety of tasks, from forecasting t...

8 Yuanyuan Shi, et al. ∙

research

∙ 11/24/2022

Fast Sampling of Diffusion Models via Operator Learning

Diffusion models have found widespread adoption in various areas. Howeve...

17 Hongkai Zheng, et al. ∙

research

∙ 11/21/2022

Can You Label Less by Using Out-of-Domain Data? Active Transfer Learning with Few-shot Instructions

Labeling social-media data for custom dimensions of toxicity and social ...

32 Rafal Kocielnik, et al. ∙

research

∙ 11/01/2022

DensePure: Understanding Diffusion Models towards Adversarial Robustness

Diffusion models have been recently employed to improve certified robust...

7 Chaowei Xiao, et al. ∙

research

∙ 10/31/2022

Accelerating Carbon Capture and Storage Modeling using Fourier Neural Operators

Carbon capture and storage (CCS) is an important strategy for reducing c...

17 Gege Wen, et al. ∙

research

∙ 10/27/2022

An Adversarial Active Sampling-based Data Augmentation Framework for Manufacturable Chip Design

Lithography modeling is a crucial problem in chip design to ensure a chi...

15 Mingjie Liu, et al. ∙

research

∙ 10/23/2022

1st Place Solution of The Robust Vision Challenge (RVC) 2022 Semantic Segmentation Track

This report describes the winning solution to the semantic segmentation ...

20 Junfei Xiao, et al. ∙

research

∙ 10/12/2022

Context Generation Improves Open Domain Question Answering

Closed-book question answering (QA) requires a model to directly answer ...

12 Dan Su, et al. ∙

research

∙ 09/30/2022

Dynamic-Backbone Protein-Ligand Structure Prediction with Multiscale Generative Diffusion Models

Molecular complexes formed by proteins and small-molecule ligands are ub...

31 Zhuoran Qiao, et al. ∙

research

∙ 09/19/2022

AdvDO: Realistic Adversarial Attacks for Trajectory Prediction

Trajectory prediction is essential for autonomous vehicles (AVs) to plan...

27 Yulong Cao, et al. ∙

research

∙ 09/15/2022

Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models

Pre-trained vision-language models (e.g., CLIP) have shown promising zer...

29 Manli Shu, et al. ∙

research

∙ 08/23/2022

Retrieval-based Controllable Molecule Generation

Generating new molecules with specified chemical and biological properti...

21 Zichao Wang, et al. ∙

research

∙ 08/03/2022

MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training

We propose MinVIS, a minimal video instance segmentation (VIS) framework...

25 De-An Huang, et al. ∙

research

∙ 07/29/2022

Robust Trajectory Prediction against Adversarial Attacks

Trajectory prediction using deep neural networks (DNNs) is an essential ...

12 Yulong Cao, et al. ∙

research

∙ 07/11/2022

Fourier Neural Operator with Learned Deformations for PDEs on General Geometries

Deep learning surrogate models have shown promise in solving partial dif...

20 Zongyi Li, et al. ∙

research

∙ 07/08/2022

Large Scale Mask Optimization Via Convolutional Fourier Neural Operator and Litho-Guided Self Training

Machine learning techniques have been extensively studied for mask optim...

20 Haoyu Yang, et al. ∙

research

∙ 06/22/2022

Langevin Monte Carlo for Contextual Bandits

We study the efficiency of Thompson sampling for contextual bandits. Exi...

23 Pan Xu, et al. ∙

research

∙ 06/17/2022

MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge

Autonomous agents have made great strides in specialist domains like Ata...

19 Linxi Fan, et al. ∙

research

∙ 06/17/2022

Thompson Sampling Achieves Õ(√(T)) Regret in Linear Quadratic Control

Thompson Sampling (TS) is an efficient method for decision-making under ...

31 Taylan Kargin, et al. ∙

research

∙ 06/07/2022

Finite-Time Regret of Thompson Sampling Algorithms for Exponential Family Multi-Armed Bandits

We study the regret of Thompson sampling (TS) algorithms for exponential...

8 Tianyuan Jin, et al. ∙

research

∙ 06/03/2022

KCRL: Krasovskii-Constrained Reinforcement Learning with Guaranteed Stability in Nonlinear Dynamical Systems

Learning a dynamical system requires stabilizing the unknown dynamics to...

17 Sahin Lale, et al. ∙

research

∙ 05/27/2022

Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object Interactions

A significant gap remains between today's visual pattern recognition mod...

16 Huaizu Jiang, et al. ∙

research

∙ 05/16/2022

Diffusion Models for Adversarial Purification

Adversarial purification refers to a class of defense methods that remov...

38 Weili Nie, et al. ∙

Anima Anandkumar

Featured Co-authors

Sign in with Google

Consider DeepAI Pro