Safe Reinforcement Learning (RL) aims to find a policy that achieves hig...
We consider the problem of computing a function of n variables using noi...
We revisit the problem of computing with noisy information considered in...
Reinforcement learning from human feedback (RLHF) has emerged as a relia...
Large Language Models (LLMs) and other large foundation models have achi...
Self-training is an important technique for solving semi-supervised lear...
The creator economy has revolutionized the way individuals can profit th...
We consider the sequential decision-making problem where the mean outcom...
We propose A-Crab (Actor-Critic Regularized by Average Bellman error), a...
We study the problem of online learning in a two-player decentralized
co...
We provide a theoretical framework for Reinforcement Learning with Human...
Offline reinforcement learning (RL), which refers to decision-making fro...
In the infinite-armed bandit problem, each arm's average reward is sampl...
Online imitation learning is the problem of how best to mimic expert
dem...
We propose Byzantine-robust federated learning protocols with nearly opt...
Reinforcement learning (RL) provides a theoretical framework for continu...
We provide a general framework for designing Generative Adversarial Netw...
Policy optimization methods are one of the most widely used classes of
R...
Many hierarchical reinforcement learning (RL) applications have empirica...
In online reinforcement learning (RL), efficient exploration remains
par...
Secure aggregation is a critical component in federated learning, which
...
Offline (or batch) reinforcement learning (RL) algorithms seek to learn ...
We study the statistical limits of Imitation Learning (IL) in episodic M...
We study the problem of off-policy evaluation in the multi-armed bandit ...
This paper introduces Fast Linearized Adaptive Policy (FLAP), a new
meta...
We present an efficient and practical (polynomial time) algorithm for on...
Imitation learning (IL) aims to mimic the behavior of an expert policy i...
We explore why many recently proposed robust estimation problems are
eff...
We analyze the performance of the Tukey median estimator under total
var...
Robust statistics traditionally focuses on outliers, or perturbations in...
A blockchain is a database of sequential events that is maintained by a
...
We deconstruct the performance of GANs into three components:
1. Formu...
We identify a trade-off between robustness and accuracy that serves as a...
We study the problem of alleviating the instability issue in the GAN tra...
We study concentration inequalities for the Kullback--Leibler (KL) diver...
For any Markov source, there exist universal codes whose normalized
code...
For any Markov source, there exist universal codes whose normalized
code...
We present Local Moment Matching (LMM), a unified methodology for
symmet...
Estimating the entropy based on data is one of the prototypical problems...
We propose an efficient algorithm for approximate computation of the pro...
We analyze the Kozachenko--Leonenko (KL) nearest neighbor estimator for ...
We consider the problem of minimax estimation of the entropy of a densit...
We show through case studies that it is easier to estimate the fundament...
The Residual Network (ResNet), proposed in He et al. (2015), utilized
sh...
Maximum likelihood is the most widely used statistical estimation techni...
In many signal detection and classification problems, we have knowledge ...