A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided Markets

02/21/2022
by   Chengchun Shi, et al.
0

The two-sided markets such as ride-sharing companies often involve a group of subjects who are making sequential decisions across time and/or location. With the rapid development of smart phones and internet of things, they have substantially transformed the transportation landscape of human beings. In this paper we consider large-scale fleet management in ride-sharing companies that involve multiple units in different areas receiving sequences of products (or treatments) over time. Major technical challenges, such as policy evaluation, arise in those studies because (i) spatial and temporal proximities induce interference between locations and times; and (ii) the large number of locations results in the curse of dimensionality. To address both challenges simultaneously, we introduce a multi-agent reinforcement learning (MARL) framework for carrying policy evaluation in these studies. We propose novel estimators for mean outcomes under different products that are consistent despite the high-dimensionality of state-action space. The proposed estimator works favorably in simulation experiments. We further illustrate our method using a real dataset obtained from a two-sided marketplace company to evaluate the effects of applying different subsidizing policies. A Python implementation of the proposed method is available at https://github.com/RunzheStat/CausalMARL.

READ FULL TEXT

page 15

page 17

research
02/18/2018

Efficient Large-Scale Fleet Management via Multi-Agent Deep Reinforcement Learning

Large-scale online ride-sharing platforms have substantially transformed...
research
07/06/2020

Towards Efficient Connected and Automated Driving System via Multi-agent Graph Reinforcement Learning

Connected and automated vehicles (CAVs) have attracted more and more att...
research
03/02/2023

Parameter Sharing with Network Pruning for Scalable Multi-Agent Deep Reinforcement Learning

Handling the problem of scalability is one of the essential issues for m...
research
02/05/2020

A Reinforcement Learning Framework for Time-Dependent Causal Effects Evaluation in A/B Testing

A/B testing, or online experiment is a standard business strategy to com...
research
12/22/2022

Certified Policy Smoothing for Cooperative Multi-Agent Reinforcement Learning

Cooperative multi-agent reinforcement learning (c-MARL) is widely applie...
research
08/14/2023

ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate

Text evaluation has historically posed significant challenges, often dem...

Please sign up or login with your details

Forgot password? Click here to reset