David Duvenaud

research

∙ 07/02/2023

Tools for Verifying Neural Models' Training Data

It is important that consumers and regulators can verify the provenance ...

0 Dami Choi, et al. ∙

research

∙ 12/28/2022

On Implicit Bias in Overparameterized Bilevel Optimization

Many problems in machine learning involve bilevel optimization (BLO), in...

0 Paul Vicol, et al. ∙

research

∙ 11/02/2021

Meta-Learning to Improve Pre-Training

Pre-training (PT) followed by fine-tuning (FT) is an effective method fo...

14 Aniruddh Raghu, et al. ∙

research

∙ 04/12/2021

Getting to the Point. Index Sets and Parallelism-Preserving Autodiff for Pointful Array Programming

We present a novel programming language design that attempts to combine ...

0 Adam Paszke, et al. ∙

research

∙ 02/16/2021

Complex Momentum for Learning in Games

We generalize gradient descent with momentum for learning in differentia...

21 Jonathan Lorraine, et al. ∙

research

∙ 02/12/2021

Infinitely Deep Bayesian Neural Networks with Stochastic Differential Equations

We perform scalable approximate inference in a recently-proposed family ...

24 Winnie Xu, et al. ∙

research

∙ 02/08/2021

Oops I Took A Gradient: Scalable Sampling for Discrete Distributions

We propose a general and scalable approximate sampling strategy for prob...

2 Will Grathwohl, et al. ∙

research

∙ 11/09/2020

Self-Tuning Stochastic Optimization with Curvature-Aware Gradient Filtering

Standard first-order stochastic optimization algorithms base their updat...

0 Ricky T. Q. Chen, et al. ∙

research

∙ 11/05/2020

Teaching with Commentaries

Effective training of deep neural networks can be challenging, and there...

3 Aniruddh Raghu, et al. ∙

research

∙ 10/08/2020

No MCMC for me: Amortized sampling for fast and stable training of energy-based models

Energy-Based Models (EBMs) present a flexible and appealing way to repre...

6 Will Grathwohl, et al. ∙

research

∙ 07/09/2020

A Study of Gradient Variance in Deep Learning

The impact of gradient noise on training deep models is widely acknowled...

20 Fartash Faghri, et al. ∙

research

∙ 07/09/2020

Learning Differential Equations that are Easy to Solve

Differential equations parameterized by neural networks become expensive...

60 Jacob Kelly, et al. ∙

research

∙ 04/01/2020

SUMO: Unbiased Estimation of Log Marginal Probability for Latent Variable Models

Standard variational lower bounds used to train latent variable models p...

8 Yucen Luo, et al. ∙

research

∙ 03/05/2020

What went wrong and when? Instance-wise Feature Importance for Time-series Models

Multivariate time series models are poised to be used for decision suppo...

17 Sana Tonekaboni, et al. ∙

research

∙ 02/13/2020

Cutting out the Middle-Man: Training and Evaluating Energy-Based Models without Sampling

We present a new method for evaluating and training unnormalized density...

1 Will Grathwohl, et al. ∙

research

∙ 01/05/2020

Scalable Gradients for Stochastic Differential Equations

The adjoint sensitivity method scalably computes gradients of solutions ...

19 Xuechen Li, et al. ∙

research

∙ 12/08/2019

Neural Networks with Cheap Differential Operators

Gradients of neural networks can be computed efficiently for any archite...

36 Ricky T. Q. Chen, et al. ∙

research

∙ 12/06/2019

Your Classifier is Secretly an Energy Based Model and You Should Treat it Like One

We propose to reinterpret a standard discriminative classifier of p(y|x)...

19 Will Grathwohl, et al. ∙

research

∙ 11/06/2019

Optimizing Millions of Hyperparameters by Implicit Differentiation

We propose an algorithm for inexpensive gradient-based hyperparameter op...

43 Jonathan Lorraine, et al. ∙

research

∙ 10/02/2019

Efficient Graph Generation with Graph Recurrent Attention Networks

We propose a new family of efficient and expressive deep generative mode...

16 Renjie Liao, et al. ∙

research

∙ 08/18/2019

Understanding Undesirable Word Embedding Associations

Word embeddings are often criticized for capturing undesirable word asso...

0 Kawin Ethayarajh, et al. ∙

research

∙ 07/08/2019

Latent ODEs for Irregularly-Sampled Time Series

Time series with non-uniform intervals occur in many applications, and a...

1 Yulia Rubanova, et al. ∙

research

∙ 06/06/2019

Residual Flows for Invertible Generative Modeling

Flow-based generative models parameterize probability distributions thro...

2 Ricky T. Q. Chen, et al. ∙

research

∙ 03/07/2019

Self-Tuning Networks: Bilevel Optimization of Hyperparameters using Structured Best-Response Functions

Hyperparameter optimization can be formulated as a bilevel optimization ...

26 Matthew MacKay, et al. ∙

research

∙ 11/02/2018

Invertible Residual Networks

Reversible deep networks provide useful theoretical guarantees and have ...

16 Jens Behrmann, et al. ∙

research

∙ 10/11/2018

Towards Understanding Linear Word Analogies

A surprising property of word vectors is that vector algebra can often b...

0 Kawin Ethayarajh, et al. ∙

research

∙ 10/02/2018

FFJORD: Free-form Continuous Dynamics for Scalable Reversible Generative Models

A promising class of generative models maps points from a simple distrib...

6 Will Grathwohl, et al. ∙

research

∙ 08/20/2018

Stochastic Combinatorial Ensembles for Defending Against Adversarial Examples

Many deep learning algorithms can be easily fooled with simple adversari...

0 George A. Adam, et al. ∙

research

∙ 07/20/2018

Explaining Image Classifiers by Adaptive Dropout and Generative In-filling

Explanations of black-box classifiers often rely on saliency maps, which...

0 Chun-Hao Chang, et al. ∙

research

∙ 07/05/2018

Scalable Recommender Systems through Recursive Evidence Chains

Recommender systems can be formulated as a matrix completion problem, pr...

6 Elias Tragas, et al. ∙

research

∙ 06/19/2018

Neural Ordinary Differential Equations

We introduce a new family of deep neural network models. Instead of spec...

11 Tian Qi Chen, et al. ∙

research

∙ 02/26/2018

Stochastic Hyperparameter Optimization through Hypernetworks

Machine learning models are often tuned by nesting optimization of model...

1 Jonathan Lorraine, et al. ∙

research

∙ 02/14/2018

Isolating Sources of Disentanglement in Variational Autoencoders

We decompose the evidence lower bound to show the existence of a term me...

0 Tian Qi Chen, et al. ∙

research

∙ 01/10/2018

Inference Suboptimality in Variational Autoencoders

Amortized inference has led to efficient approximate inference for large...

0 Chris Cremer, et al. ∙

research

∙ 12/17/2017

Generating and designing DNA with deep generative models

We propose generative neural network methods to generate DNA sequences a...

0 Nathan Killoran, et al. ∙

research

∙ 12/06/2017

Noisy Natural Gradient as Variational Inference

Combining the flexibility of deep learning with Bayesian uncertainty est...

0 Guodong Zhang, et al. ∙

research

∙ 04/10/2017

Reinterpreting Importance-Weighted Autoencoders

The standard interpretation of importance-weighted autoencoders is that ...

0 Chris Cremer, et al. ∙

research

∙ 03/27/2017

Sticking the Landing: Simple, Lower-Variance Gradient Estimators for Variational Inference

We propose a simple and general variant of the standard reparameterized ...

0 Geoffrey Roeder, et al. ∙

research

∙ 08/22/2016

Neural networks for the prediction organic chemistry reactions

Reaction prediction remains one of the major challenges for organic chem...

0 Jennifer N. Wei, et al. ∙

research

∙ 03/20/2016

Composing graphical models with neural networks for structured representations and fast inference

We propose a general modeling and inference framework that composes prob...

0 Matthew J. Johnson, et al. ∙

research

∙ 09/30/2015

Convolutional Networks on Graphs for Learning Molecular Fingerprints

We introduce a convolutional neural network that operates directly on gr...

0 David Duvenaud, et al. ∙

research

∙ 04/06/2015

Early Stopping is Nonparametric Variational Inference

We show that unconverged stochastic gradient descent can be interpreted ...

0 Dougal Maclaurin, et al. ∙

research

∙ 02/11/2015

Gradient-based Hyperparameter Optimization through Reversible Learning

Tuning hyperparameters of learning algorithms is hard because gradients ...

0 Dougal Maclaurin, et al. ∙

research

∙ 08/09/2014

Warped Mixtures for Nonparametric Cluster Shapes

A mixture of Gaussians fit to a single curved or heavy-tailed cluster wi...

0 Tomoharu Iwata, et al. ∙

research

∙ 08/09/2014

Optimally-Weighted Herding is Bayesian Quadrature

Herding and kernel herding are deterministic methods of choosing samples...

0 Ferenc Huszár, et al. ∙

research

∙ 02/24/2014

Avoiding pathologies in very deep networks

Choosing appropriate architectures and regularization strategies for dee...

0 David Duvenaud, et al. ∙

research

∙ 02/18/2014

Automatic Construction and Natural-Language Description of Nonparametric Regression Models

This paper presents the beginnings of an automatic statistician, focusin...

0 James Robert Lloyd, et al. ∙

research

∙ 02/20/2013

Structure Discovery in Nonparametric Regression through Compositional Kernel Search

Despite its importance, choosing the structural form of the kernel in no...

0 David Duvenaud, et al. ∙

research

∙ 12/19/2011

Additive Gaussian Processes

We introduce a Gaussian process model of functions which are additive. A...

0 David Duvenaud, et al. ∙

David Duvenaud

Featured Co-authors

Sign in with Google

Consider DeepAI Pro