We investigate the approximation efficiency of score functions by deep n...
Attention layers – which map a sequence of inputs to a sequence of outpu...
Neural sequence models based on the transformer architecture have
demons...
This paper studies the fundamental limits of reinforcement learning (RL)...
In recent decades, the emergence of social networks has enabled internet...
In high dimensional variable selection problems, statisticians often see...
Partial Observability – where agents can only observe partial informatio...
Finding unified complexity measures and algorithms for sample-efficient
...
A conceptually appealing approach for learning Extensive-Form Games (EFG...
Imperfect-Information Extensive-Form Games (IIEFGs) is a prevalent model...
The Quantum Approximate Optimization Algorithm (QAOA) is a general purpo...
Quantifying the data uncertainty in learning tasks is often done by lear...
This paper resolves the open question of designing near-optimal algorith...
Recent empirical work has shown that hierarchical convolutional kernels
...
To understand how deep learning works, it is crucial to understand the
t...
Multi-agent reinforcement learning has made substantial empirical progre...
We study mean-field variational Bayesian inference using the TAP approac...
Estimating the data uncertainty in regression tasks is often done by lea...
Recent work showed that there could be a large gap between the classical...
A number of machine learning tasks entail a high degree of invariance: t...
Modern machine learning models with high accuracy are often miscalibrate...
Consider the classical supervised learning problem: we are given data
(y...
For a certain scaling of the initialization of stochastic gradient desce...
Deep learning methods operate in regimes that defy the traditional
stati...
We study the supervised learning problem under either of the following t...
We consider the problem of learning an unknown function f_ on the
d-dime...
We study a family of (potentially non-convex) constrained optimization
p...
We consider learning two layer neural networks using stochastic gradient...
We consider the Sherrington-Kirkpatrick model of spin glasses with
ferro...
Multi-layer neural networks are among the most powerful models in machin...
We consider the problem of estimating a large rank-one tensor
u^⊗ k∈( R...
A number of statistical estimation problems can be addressed by semidefi...
Most high-dimensional estimation and prediction methods propose to minim...