Modified policy iteration (MPI) also known as optimistic policy iteratio...
Many policy-based reinforcement learning (RL) algorithms can be viewed a...
We study learning-based admission control for a classical Erlang-B block...
We introduce and study a group formation game in which individuals/agent...
We study the problem of recovering a planted matching in randomly weight...