Motivated by the prevalence of “price protection guarantee", which allow...
The rise of big data analytics has automated the decision-making of comp...
Motivated by practical considerations in machine learning for financial
...
Motivated by practical needs in online experimentation and off-policy
le...
We consider model-free reinforcement learning (RL) in non-stationary Mar...
We consider un-discounted reinforcement learning (RL) in Markov decision...
We propose algorithms with state-of-the-art dynamic regret bounds for
un...
We introduce general data-driven decision-making algorithms that achieve...
We study the problem of learning across a sequence of price
experiments ...
We introduce efficient algorithms which achieve nearly optimal regrets f...
We introduce algorithms that achieve state-of-the-art dynamic regret
bou...