Ruihao Zhu

research

∙ 11/03/2022

Phase Transitions in Learning and Earning under Price Protection Guarantee

Motivated by the prevalence of “price protection guarantee", which allow...

0 Qing Feng, et al. ∙

research

∙ 11/02/2022

Learning to Price Supply Chain Contracts against a Learning Retailer

The rise of big data analytics has automated the decision-making of comp...

0 Xuejun Zhao, et al. ∙

research

∙ 08/04/2022

Risk-Aware Linear Bandits: Theory and Applications in Smart Order Routing

Motivated by practical considerations in machine learning for financial ...

0 Jingwei Ji, et al. ∙

research

∙ 11/08/2021

Safe Optimal Design with Applications in Policy Learning

Motivated by practical needs in online experimentation and off-policy le...

0 Ruihao Zhu, et al. ∙

research

∙ 10/07/2020

Near-Optimal Regret Bounds for Model-Free RL in Non-Stationary Episodic MDPs

We consider model-free reinforcement learning (RL) in non-stationary Mar...

2 Weichao Mao, et al. ∙

research

∙ 06/24/2020

Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism

We consider un-discounted reinforcement learning (RL) in Markov decision...

0 Wang Chi Cheung, et al. ∙

research

∙ 06/07/2019

Reinforcement Learning under Drift

We propose algorithms with state-of-the-art dynamic regret bounds for un...

0 Wang Chi Cheung, et al. ∙

research

∙ 03/04/2019

Hedging the Drift: Learning to Optimize under Non-Stationarity

We introduce general data-driven decision-making algorithms that achieve...

0 Wang Chi Cheung, et al. ∙

research

∙ 02/28/2019

Meta Dynamic Pricing: Learning Across Experiments

We study the problem of learning across a sequence of price experiments ...

0 Hamsa Bastani, et al. ∙

research

∙ 10/24/2018

Learning to Route Efficiently with End-to-End Feedback: The Value of Networked Structure

We introduce efficient algorithms which achieve nearly optimal regrets f...

0 Ruihao Zhu, et al. ∙

research

∙ 10/06/2018

Learning to Optimize under Non-Stationarity

We introduce algorithms that achieve state-of-the-art dynamic regret bou...

0 Wang Chi Cheung, et al. ∙

Ruihao Zhu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro