Learning to control an agent from data collected offline in a rich
pixel...
We consider a multi-armed bandit problem with M latent contexts, where a...
We consider episodic reinforcement learning in reward-mixing Markov deci...
A person walking along a city street who tries to model all aspects of t...
In real-world reinforcement learning applications the learner's observat...
Real-world sequential decision making problems commonly involve partial
...
Motivated by online recommendation systems, we propose the problem of fi...
Many real-world applications of reinforcement learning (RL) require the ...
A fundamental concept in control theory is that of controllability, wher...
We consider a stochastic multi-armed bandit setting where feedback is li...
Learning a near optimal policy in a partially observable system remains ...
We study the Stochastic Shortest Path (SSP) problem in which an agent ha...
In this work, we consider the regret minimization problem for reinforcem...
The computational model of reinforcement learning is based upon the abil...
We study linear contextual bandits with access to a large, partially
obs...
We propose deep Reinforcement Learning (RL) algorithms inspired by mirro...
In many sequential decision-making problems, the goal is to optimize a
u...
Policy optimization methods are one of the most widely used classes of
R...
Multi-step greedy policies have been extensively used in model-based
Rei...
Real Time Dynamic Programming (RTDP) is a well-known Dynamic Programming...
Trust region policy optimization (TRPO) is a popular and empirically
suc...
State-of-the-art efficient model-based Reinforcement Learning (RL) algor...
A policy is said to be robust if it maximizes the reward while consideri...
The objective of Reinforcement Learning is to learn an optimal policy by...
Finite-horizon lookahead policies are abundantly used in Reinforcement
L...
Multiple-step lookahead policies have demonstrated high empirical compet...
The famous Policy Iteration algorithm alternates between policy improvem...