Rebalancing Dockless Bike Sharing Systems

02/13/2018
by   Ling Pan, et al.
0

Bike sharing provides an environment-friendly way for traveling and is booming all over the world. Yet, due to the high similarity of user travel patterns, the bike imbalance problem constantly occurs, especially for dockless bike sharing systems, causing significant impact on service quality and company revenue. Thus, it has become a critical task for bike sharing systems to resolve such imbalance efficiently. We model this problem as a Markov decision process (MDP), which takes both temporal and spatial features into consideration. We propose a novel deep reinforcement learning algorithm called Loss-Reduced Reinforcement Pricing (LRP), which builds upon the deterministic policy gradient algorithm. Different from existing methods that often ignore spatial information and rely heavily on accurate prediction, LRP is embedded with a novel network architecture to incorporate the dependence of neighboring regions, for reducing the training loss in Q-function learning. We conduct extensive experiments to evaluate the performance of the LRP algorithm, based on trajectory data from Mobike, a major Chinese dockless bike sharing company. Results show that LRP performs close to the 24-timeslot look-ahead optimization, and outperforms state-of-the-art methods in both service level and bike distribution. It also transfers well when applied to unseen areas, and can even make additional profit with the given budget. We further propose the first hybrid rebalancing system, which take advantages of both the truck-based and user-based approaches, and outperforms each individual approach.

READ FULL TEXT
research
03/29/2017

Inverse Risk-Sensitive Reinforcement Learning

We address the problem of inverse reinforcement learning in Markov decis...
research
12/17/2021

D3PG: Dirichlet DDGP for Task Partitioning and Offloading with Constrained Hybrid Action Space in Mobile Edge Computing

Mobile Edge Computing (MEC) has been regarded as a promising paradigm to...
research
07/10/2018

Generalized deterministic policy gradient algorithms

We study a setting of reinforcement learning, where the state transition...
research
11/18/2019

Mechanism Design with Predicted Task Revenue for Bike Sharing Systems

Bike sharing systems have been widely deployed around the world in recen...
research
06/08/2021

A Deep Value-network Based Approach for Multi-Driver Order Dispatching

Recent works on ride-sharing order dispatching have highlighted the impo...
research
11/26/2018

A Policy Gradient Method with Variance Reduction for Uplift Modeling

Uplift modeling aims to directly model the incremental impact of a treat...

Please sign up or login with your details

Forgot password? Click here to reset