
CAD: Debiasing the Lasso with inaccurate covariate model
We consider the problem of estimating a lowdimensional parameter in hig...
read it

Streaming Belief Propagation for Community Detection
The community detection problem requires to cluster the nodes of a netwo...
read it

Minimum complexity interpolation in random features models
Despite their many appealing properties, kernel methods are heavily affe...
read it

Deep learning: a statistical viewpoint
The remarkable practical success of deep learning has revealed some majo...
read it

Learning with invariances in random features and kernel models
A number of machine learning tasks entail a high degree of invariance: t...
read it

Generalization error of random features and kernel methods: hypercontractivity and kernel matrix concentration
Consider the classical supervised learning problem: we are given data (y...
read it

Underspecification Presents Challenges for Credibility in Modern Machine Learning
ML models often exhibit unexpectedly poor behavior when they are deploye...
read it

The Lasso with general Gaussian designs with applications to hypothesis testing
The Lasso is a method for highdimensional regression, which is now comm...
read it

The Interpolation Phase Transition in Neural Networks: Memorization and Generalization under Lazy Training
Modern neural networks are often operated in a strongly overparametrized...
read it

When Do Neural Networks Outperform Kernel Methods?
For a certain scaling of the initialization of stochastic gradient desce...
read it

The estimation error of general first order methods
Modern largescale statistical models require to estimate thousands to m...
read it

Imputation for HighDimensional Linear Regression
We study highdimensional regression with missing entries in the covaria...
read it

The generalization error of maxmargin linear classifiers: Highdimensional asymptotics in the overparametrized regime
Modern machine learning models are often so complex that they achieve va...
read it

The generalization error of random features regression: Precise asymptotics and double descent curve
Deep learning methods operate in regimes that defy the traditional stati...
read it

Limitations of Lazy Training of Twolayers Neural Networks
We study the supervised learning problem under either of the following t...
read it

Linearized twolayers neural networks in high dimension
We consider the problem of learning an unknown function f_ on the ddime...
read it

On the computational tractability of statistical estimation on amenable graphs
We consider the problem of estimating a vector of discrete variables (θ_...
read it

Fundamental Barriers to HighDimensional Regression with Convex Penalties
In highdimensional regression, we attempt to estimate a parameter vecto...
read it

Surprises in HighDimensional Ridgeless Least Squares Interpolation
Interpolators  estimators that achieve zero training error  have att...
read it

Meanfield theory of twolayers neural networks: dimensionfree bounds and kernel limit
We consider learning two layer neural networks using stochastic gradient...
read it

Analysis of a TwoLayer Neural Network via Displacement Convexity
Fitting a function by using linear combinations of a large number N of `...
read it

The distribution of the Lasso: Uniform control over sparse balls and adaptive parameter tuning
The Lasso is a popular regression method for highdimensional problems i...
read it

Adapting to Unknown Noise Distribution in Matrix Denoising
We consider the problem of estimating an unknown matrix ∈^m× n, from obs...
read it

TAP free energy, spin glasses, and variational inference
We consider the SherringtonKirkpatrick model of spin glasses with ferro...
read it

Contextual Stochastic Block Models
We provide the first information theoretic tight analysis for inference ...
read it

A Mean Field View of the Landscape of TwoLayers Neural Networks
Multilayer neural networks are among the most powerful models in machin...
read it

The threshold for SDPrefutation of random regular NAE3SAT
Unlike its cousin 3SAT, the NAE3SAT (notallequal3SAT) problem has th...
read it

On the Connection Between Learning TwoLayers Neural Networks and Tensor Decomposition
We establish connections between the problem of learning a twolayers ne...
read it

An Instability in Variational Inference for Topic Models
Topic models are Bayesian models that are frequently used to capture the...
read it

The landscape of the spiked tensor model
We consider the problem of estimating a large rankone tensor u^⊗ k∈( R...
read it

Estimation of LowRank Matrices via Approximate Message Passing
Consider the problem of estimating a lowrank symmetric matrix when its ...
read it

Inference in Graphical Models via Semidefinite Programming Hierarchies
Maximum A posteriori Probability (MAP) inference in graphical models amo...
read it

Learning Combinations of Sigmoids Through Gradient Estimation
We develop a new approach to learn the parameters of regression models w...
read it

Fundamental Limits of Weak Recovery with Applications to Phase Retrieval
In phase retrieval we want to recover an unknown signal x∈ C^d from n q...
read it

Nonnegative Matrix Factorization via Archetypal Analysis
Given a collection of data points, nonnegative matrix factorization (NM...
read it

Solving SDPs for synchronization and MaxCut problems via the Grothendieck inequality
A number of statistical estimation problems can be addressed by semidefi...
read it

Spectral algorithms for tensor completion
In the tensor completion problem, one seeks to estimate a lowrank tenso...
read it

How Well Do Local Algorithms Solve Semidefinite Programs?
Several probabilistic models from highdimensional statistics and machin...
read it

The Landscape of Empirical Risk for Nonconvex Losses
Most highdimensional estimation and prediction methods propose to minim...
read it

Performance of a community detection algorithm based on semidefinite programming
The problem of detecting communities in a graph is maybe one the most st...
read it

Online Rules for Control of False Discovery Rate and False Discovery Exceedance
Multiple hypothesis testing is a core problem in statistical inference a...
read it

A Grothendiecktype inequality for local maxima
A large number of problems in optimization, machine learning, signal pro...
read it

Convergence rates of subsampled Newton methods
We consider the problem of minimizing a sum of n functions over a convex...
read it

Debiasing the Lasso: Optimal Sample Size for Gaussian Designs
Performing statistical inference in highdimension is an outstanding cha...
read it

Improved SumofSquares Lower Bounds for Hidden Clique and Hidden Submatrix Problems
Given a large data matrix A∈R^n× n, we consider the problem of determini...
read it

Finding One Community in a Sparse Graph
We consider a random sparse graph with bounded average degree, in which ...
read it

A statistical model for tensor PCA
We consider the Principal Component Analysis problem for large tensors o...
read it

Statistical Estimation: From Denoising to Sparse Regression and Hidden Cliques
These notes review six lectures given by Prof. Andrea Montanari on the t...
read it

Guess Who Rated This Movie: Identifying Users Through Subspace Clustering
It is often the case that, within an online recommender system, multiple...
read it

Sparse PCA via Covariance Thresholding
In sparse principal component analysis we are given noisy observations o...
read it