-
Simultaneously Learning Stochastic and Adversarial Episodic MDPs with Known Transition
This work studies the problem of learning episodic Markov Decision Proce...
read it
-
Learning Adversarial MDPs with Bandit Feedback and Unknown Transition
We consider the problem of learning in episodic finite-horizon Markov de...
read it
-
Deep Reinforcement Learning for Multi-Driver Vehicle Dispatching and Repositioning Problem
Order dispatching and driver repositioning (also known as fleet manageme...
read it

Tiancheng Jin
is this you? claim profile