Classical theory in reinforcement learning (RL) predominantly focuses on...
Data-driven algorithms can adapt their internal structure or parameters ...
Offline policy evaluation is a fundamental statistical problem in
reinfo...
We study the problem of information sharing and cooperation in Multi-Pla...
Standard approaches to decision-making under uncertainty focus on sequen...
We study the problem of adaptive control of the linear quadratic regulat...
Since its introduction a decade ago, relative entropy policy search
(REP...
We propose a simple model selection approach for algorithms in stochasti...
Maximum a posteriori (MAP) inference in discrete-valued Markov random fi...
We study a constrained contextual linear bandit setting, where the goal ...
We investigate the capacity control provided by dropout in various machi...
We show that diffusion processes can be exploited to study the posterior...
We formulate gradient-based Markov chain Monte Carlo (MCMC) sampling as
...
Many machine learning models are vulnerable to adversarial attacks. It h...
In this paper, we study robust large-scale distributed learning in the
p...
In large-scale distributed learning, security issues have become increas...
This paper presents a margin-based multiclass generalization bound for n...
Langevin diffusion is a commonly used tool for sampling from a given
dis...
We study online learning under logarithmic loss with regular parametric
...
Advice-efficient prediction with expert advice (in analogy to label-effi...
We introduce new online and batch algorithms that are robust to data wit...