Pattern Transfer Learning for Reinforcement Learning in Order Dispatching

05/27/2021
by   Runzhe Wan, et al.
0

Order dispatch is one of the central problems to ride-sharing platforms. Recently, value-based reinforcement learning algorithms have shown promising performance on this problem. However, in real-world applications, the non-stationarity of the demand-supply system poses challenges to re-utilizing data generated in different time periods to learn the value function. In this work, motivated by the fact that the relative relationship between the values of some states is largely stable across various environments, we propose a pattern transfer learning framework for value-based reinforcement learning in the order dispatch problem. Our method efficiently captures the value patterns by incorporating a concordance penalty. The superior performance of the proposed method is supported by experiments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/14/2017

Shared Learning : Enhancing Reinforcement in Q-Ensembles

Deep Reinforcement Learning has been able to achieve amazing successes i...
research
04/11/2018

Universal Successor Representations for Transfer Reinforcement Learning

The objective of transfer reinforcement learning is to generalize from a...
research
04/03/2018

StarCraft Micromanagement with Reinforcement Learning and Curriculum Transfer Learning

Real-time strategy games have been an important field of game artificial...
research
12/31/2019

The Gambler's Problem and Beyond

We analyze the Gambler's problem, a simple reinforcement learning proble...
research
02/23/2022

Learning Relative Return Policies With Upside-Down Reinforcement Learning

Lately, there has been a resurgence of interest in using supervised lear...
research
02/10/2020

Qatten: A General Framework for Cooperative Multiagent Reinforcement Learning

In many real-world settings, a team of cooperative agents must learn to ...
research
01/20/2022

Two-Sample Testing in Reinforcement Learning

Value-based reinforcement-learning algorithms have shown strong performa...

Please sign up or login with your details

Forgot password? Click here to reset