Shikai Luo

research

∙ 12/29/2022

An Instrumental Variable Approach to Confounded Off-Policy Evaluation

Off-policy evaluation (OPE) is a method for estimating the return of a t...

0 Yang Xu, et al. ∙

research

∙ 12/29/2022

Quantile Off-Policy Evaluation via Deep Conditional Generative Learning

Off-Policy evaluation (OPE) is concerned with evaluating a new target po...

0 Yang Xu, et al. ∙

research

∙ 06/14/2022

Conformal Off-Policy Prediction

Off-policy evaluation is critical in a number of applications where new ...

0 Yingying Zhang, et al. ∙

research

∙ 02/26/2022

Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite Horizons

We consider reinforcement learning (RL) methods in offline domains witho...

0 Chengchun Shi, et al. ∙

research

∙ 02/22/2022

Policy Evaluation for Temporal and/or Spatial Dependent Experiments in Ride-sourcing Platforms

Policy evaluation based on A/B testing has attracted considerable intere...

0 Shikai Luo, et al. ∙

research

∙ 02/22/2022

Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process

This paper is concerned with constructing a confidence interval for a ta...

0 Chengchun Shi, et al. ∙

research

∙ 02/21/2022

A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided Markets

The two-sided markets such as ride-sharing companies often involve a gro...

0 Chengchun Shi, et al. ∙

research

∙ 11/06/2021

An Online Sequential Test for Qualitative Treatment Effects

Tech companies (e.g., Google or Facebook) often use randomized online ex...

0 Chengchun Shi, et al. ∙

research

∙ 05/27/2021

Pattern Transfer Learning for Reinforcement Learning in Order Dispatching

Order dispatch is one of the central problems to ride-sharing platforms....

0 Runzhe Wan, et al. ∙

research

∙ 02/05/2020

A Reinforcement Learning Framework for Time-Dependent Causal Effects Evaluation in A/B Testing

A/B testing, or online experiment is a standard business strategy to com...

0 Chengchun Shi, et al. ∙

research

∙ 07/29/2014

Sure Screening for Gaussian Graphical Models

We propose graphical sure screening, or GRASS, a very simple and computa...

0 Shikai Luo, et al. ∙

Shikai Luo

Featured Co-authors

Sign in with Google

Consider DeepAI Pro