Pavel Izmailov

research

∙ 06/19/2023

Simple and Fast Group Robustness by Automatic Feature Reweighting

A major challenge to out-of-distribution generalization is reliance on s...

0 Shikai Qiu, et al. ∙

research

∙ 12/15/2022

FlexiViT: One Model for All Patch Sizes

Vision Transformers convert images to sequences by slicing them into pat...

15 Lucas Beyer, et al. ∙

research

∙ 10/20/2022

On Feature Learning in the Presence of Spurious Correlations

Deep classifiers are known to rely on spurious features x2013 patterns w...

0 Pavel Izmailov, et al. ∙

research

∙ 04/06/2022

Last Layer Re-Training is Sufficient for Robustness to Spurious Correlations

Neural network classifiers can largely rely on simple spurious features,...

0 Polina Kirichenko, et al. ∙

research

∙ 03/30/2022

On Uncertainty, Tempering, and Data Augmentation in Bayesian Classification

Aleatoric uncertainty captures the inherent randomness of the data, such...

0 Sanyam Kapoor, et al. ∙

research

∙ 02/23/2022

Bayesian Model Selection, the Marginal Likelihood, and Generalization

How do we compare between hypotheses that are entirely consistent with o...

0 Sanae Lotfi, et al. ∙

research

∙ 06/22/2021

Dangers of Bayesian Model Averaging under Covariate Shift

Approximate Bayesian inference for neural networks is considered a robus...

0 Pavel Izmailov, et al. ∙

research

∙ 06/10/2021

Does Knowledge Distillation Really Work?

Knowledge distillation is a popular technique for training a small stude...

0 Samuel Stanton, et al. ∙

research

∙ 04/29/2021

What Are Bayesian Neural Network Posteriors Really Like?

The posterior over Bayesian neural network (BNN) parameters is extremely...

0 Pavel Izmailov, et al. ∙

research

∙ 10/22/2020

Learning Invariances in Neural Networks

Invariances to translations have imbued convolutional neural networks wi...

0 Gregory Benton, et al. ∙

research

∙ 06/15/2020

Why Normalizing Flows Fail to Detect Out-of-Distribution Data

Detecting out-of-distribution (OOD) data is crucial for robust machine l...

13 Polina Kirichenko, et al. ∙

research

∙ 02/25/2020

Generalizing Convolutional Neural Networks for Equivariance to Lie Groups on Arbitrary Continuous Data

The translation equivariance of convolutional layers enables convolution...

11 Marc Finzi, et al. ∙

research

∙ 02/20/2020

Bayesian Deep Learning and a Probabilistic Perspective of Generalization

The key distinguishing property of a Bayesian approach is marginalizatio...

41 Andrew Gordon Wilson, et al. ∙

research

∙ 12/30/2019

Semi-Supervised Learning with Normalizing Flows

Normalizing flows transform a latent distribution through an invertible ...

13 Pavel Izmailov, et al. ∙

research

∙ 07/17/2019

Subspace Inference for Bayesian Deep Learning

Bayesian inference was once a gold standard for learning with neural net...

3 Pavel Izmailov, et al. ∙

research

∙ 02/07/2019

A Simple Baseline for Bayesian Uncertainty in Deep Learning

We propose SWA-Gaussian (SWAG), a simple, scalable, and general purpose ...

20 Wesley Maddox, et al. ∙

research

∙ 06/14/2018

Improving Consistency-Based Semi-Supervised Learning with Weight Averaging

Recent advances in deep unsupervised learning have renewed interest in s...

0 Ben Athiwaratkun, et al. ∙

research

∙ 03/14/2018

Averaging Weights Leads to Wider Optima and Better Generalization

Deep neural networks are typically trained by optimizing a loss function...

0 Pavel Izmailov, et al. ∙

research

∙ 02/27/2018

Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs

The loss functions of deep neural networks are complex and their geometr...

0 Timur Garipov, et al. ∙

research

∙ 01/05/2018

Tensor Train decomposition on TensorFlow (T3F)

Tensor Train decomposition is used across many branches of machine learn...

0 Alexander Novikov, et al. ∙

research

∙ 10/19/2017

Scalable Gaussian Processes with Billions of Inducing Inputs via Tensor Train Decomposition

We propose a method (TT-GP) for approximate inference in Gaussian Proces...

0 Pavel Izmailov, et al. ∙

research

∙ 11/18/2016

Faster variational inducing input Gaussian process classification

Gaussian processes (GP) provide a prior over functions and allow finding...

0 Pavel Izmailov, et al. ∙

Pavel Izmailov

Featured Co-authors

Sign in with Google

Consider DeepAI Pro