Reward Design for Driver Repositioning Using Multi-Agent Reinforcement Learning

02/17/2020
by   Zhenyu Shou, et al.
0

A large portion of the passenger requests is reportedly unserviced, partially due to vacant for-hire drivers' cruising behavior during the passenger seeking process. This paper aims to model the multi-driver repositioning task through a mean field multi-agent reinforcement learning (MARL) approach. Noticing that the direct application of MARL to the multi-driver system under a given reward mechanism will very likely yield a suboptimal equilibrium due to the selfishness of drivers, this study proposes a reward design scheme with which a more desired equilibrium can be reached. To effectively solve the bilevel optimization problem with upper level as the reward design and the lower level as a multi-agent system (MAS), a Bayesian optimization algorithm is adopted to speed up the learning process. We then use a synthetic dataset to test the proposed model. The results show that the weighted average of order response rate and overall service charge can be improved by 4 service charge, compared with that of no reward design.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2018

Game of Coins

We formalize the current practice of strategic mining in multi-cryptocur...
research
01/31/2019

Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning

A fundamental question in any peer-to-peer ridesharing system is how to,...
research
02/13/2021

Equilibrium Inverse Reinforcement Learning for Ride-hailing Vehicle Network

Ubiquitous mobile computing have enabled ride-hailing services to collec...
research
10/16/2012

Toward Large-Scale Agent Guidance in an Urban Taxi Service

Empty taxi cruising represents a wastage of resources in the context of ...
research
10/14/2022

Distributional Reward Estimation for Effective Multi-Agent Deep Reinforcement Learning

Multi-agent reinforcement learning has drawn increasing attention in pra...
research
12/02/2022

Multi-Agent Reinforcement Learning with Reward Delays

This paper considers multi-agent reinforcement learning (MARL) where the...
research
11/22/2020

Multi-Agent Reinforcement Learning for Dynamic Routing Games: A Unified Paradigm

This paper aims to develop a unified paradigm that models one's learning...

Please sign up or login with your details

Forgot password? Click here to reset