
Deep learning: a statistical viewpoint
The remarkable practical success of deep learning has revealed some majo...
On the Minimal Error of Empirical Risk Minimization
We study the minimal error of the Empirical Risk Minimization (ERM) proc...
Topk eXtreme Contextual Bandits with Arm Hierarchy
Motivated by modern applications, such as online advertisement and recom...
Learning the Linear Quadratic Regulator from Nonlinear Observations
We introduce a new problem setting for continuous control called the LQR...
InstanceDependent Complexity of Contextual Bandits and Reinforcement Learning: A DisagreementBased Perspective
In the classical multiarmed bandit problem, instancedependent algorith...
Fast Mixing of MultiScale Langevin Dynamics under the Manifold Hypothesis
Recently, the task of image generation has attracted much attention. In ...
On Suboptimality of Least Squares with Application to Estimation of Convex Bodies
We develop a technique for establishing lower bounds on the sample compl...
Learning nonlinear dynamical systems from a single trajectory
We introduce algorithms for learning nonlinear dynamical systems of the ...
Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles
A fundamental challenge in contextual bandits is to develop flexible, ge...
Generative Modeling with Denoising AutoEncoders and Langevin Sampling
We study convergence of a generative modeling method that first estimate...
ℓ_∞ Vector Contraction for Rademacher Complexity
We show that the Rademacher complexity of any R^Kvalued function class ...
On the Risk of MinimumNorm Interpolants and Restricted Lower Isometry of Kernels
We study the risk of minimumnorm interpolants of data in a Reproducing ...
Breast Tumor Cellularity Assessment using Deep Neural Networks
Breast cancer is one of the main causes of death worldwide. Histopatholo...
Consistency of Interpolation with Laplace Kernels is a HighDimensional Phenomenon
We show that minimumnorm interpolation in the Reproducing Kernel Hilber...
Just Interpolate: Kernel "Ridgeless" Regression Can Generalize
In the absence of explicit regularization, Kernel "Ridgeless" Regression...
Does data interpolation contradict statistical optimality?
We show that learning methods interpolating the training data can achiev...
Angiodysplasia Detection and Localization Using Deep Convolutional Neural Networks
Accurate detection and localization for angiodysplasia lesions is an imp...
Online Learning: Sufficient Statistics and the Burkholder Method
We uncover a fairly general principle in online learning: If regret can ...
Automatic Instrument Segmentation in RobotAssisted Surgery Using Deep Learning
Semantic segmentation of robotic instruments is an important problem for...
Deep Convolutional Neural Networks for Breast Cancer Histology Image Analysis
Breast cancer is one of the main causes of cancer death worldwide. Early...
Theory of Deep Learning IIb: Optimization Properties of SGD
In Theory IIb we characterize with a mix of theory and experiments the o...
SizeIndependent Sample Complexity of Neural Networks
We study the sample complexity of learning neural networks, by providing...
Pediatric Bone Age Assessment Using Deep Convolutional Neural Networks
Skeletal bone age assessment is a common clinical practice to diagnose e...
FisherRao Metric, Geometry, and Complexity of Neural Networks
We study the relationship between geometry and capacity measures for dee...
Weighted Message Passing and Minimum Energy Flow for Heterogeneous Stochastic Block Models with Side Information
We study the misclassification error for community detection in general ...
ZigZag: A new approach to adaptive online learning
We develop a novel family of algorithms for the online learning setting ...
Nonconvex learning via Stochastic Gradient Langevin Dynamics: a nonasymptotic analysis
Stochastic Gradient Langevin Dynamics (SGLD) is a popular variant of Sto...
A Tutorial on Online Supervised Learning with Applications to Node Classification in Social Networks
We revisit the elegant observation of T. Cover '65 which, perhaps, is no...
Inference via Message Passing on Partially Labeled Stochastic Block Models
We study the community detection and recovery problem in partiallylabel...
BISTRO: An Efficient RelaxationBased Method for Contextual Bandits
We present efficient algorithms for the problem of contextual bandits wi...
On Equivalence of Martingale Tail Bounds and Deterministic Regret Inequalities
We study an equivalence of (i) deterministic pathwise statements appeari...
Adaptive Online Learning
We propose a general framework for studying adaptive regret bounds in th...
Hierarchies of Relaxations for Online Prediction Problems with Evolving Constraints
We study online prediction where regret of the algorithm is measured aga...
Learning with Square Loss: Localization through Offset Rademacher Complexity
We consider regression with square loss and general classes of functions...
Computational and Statistical Boundaries for Submatrix Localization in a Large Noisy Matrix
The interplay between computational efficiency and statistical accuracy ...
Sequential Probability Assignment with Binary Alphabets and Large Classes of Experts
We analyze the problem of sequential probability assignment for binary o...
Online Nonparametric Regression with General Loss Functions
This paper establishes minimax rates for online regression with arbitrar...
Online Optimization : Competing with Dynamic Comparators
Recent literature on online learning has focused on developing adaptive ...
Distributed Detection : Finitetime Analysis and Impact of Network Topology
This paper addresses the problem of distributed detection in multiagent...
Geometric Inference for General HighDimensional Linear Inverse Problems
This paper presents a unified geometric framework for the statistical an...
On ZerothOrder Stochastic Convex Optimization via Random Walks
We propose a method for zeroth order stochastic convex optimization that...
Online Nonparametric Regression
We establish optimal rates for online regression for arbitrary classes o...
Online Learning of Dynamic Parameters in Social Networks
This paper addresses the problem of online learning in a dynamic setting...
Efficient Sampling from TimeVarying LogConcave Distributions
We propose a computationally efficient random walk on a convex body whic...
Competing With Strategies
We study the problem of online learning with a notion of regret defined ...
Online Learning with Predictable Sequences
We present methods for online linear optimization that take advantage of...
Relax and Localize: From Value to Algorithms
We show a principled way of deriving online learning algorithms from a m...
Online Learning: Stochastic and Constrained Adversaries
Learning theory has largely focused on two main learning scenarios. The ...
Online Learning: Beyond Regret
We study online learnability of a wide class of problems, extending the ...
