While numerous works have focused on devising efficient algorithms for
r...
In online reinforcement learning (online RL), balancing exploration and
...
While quantum reinforcement learning (RL) has attracted a surge of atten...
We study stochastic delayed feedback in general multi-agent sequential
d...
Sim-to-real transfer trains RL agents in the simulated environments and ...
It is well-known that modern neural networks are vulnerable to adversari...
We study human-in-the-loop reinforcement learning (RL) with trajectory
p...
We study multi-player general-sum Markov games with one of the players
d...
Policy optimization methods are one of the most widely used classes of
R...
Despite a large amount of effort in dealing with heavy-tailed error in
m...
We study episodic reinforcement learning (RL) in non-stationary linear k...
We study bandits and reinforcement learning (RL) subject to a conservati...
While deep reinforcement learning has achieved tremendous successes in
v...