Sim-To-Real Optimization Of Complex Real World Mobile Network with Imperfect Information via Deep Reinforcement Learning from Self-play

02/18/2018
by   Yongxi Tan, et al.
0

Mobile network that millions of people use every day is one of the most complex systems in real world. Optimization of mobile network to meet exploding customer demand and reduce CAPEX/OPEX poses greater challenges than in prior works. Learning to solve complex problems in real world to benefit everyone and make the world better has long been ultimate goal of AI. However, it still remains an unsolved problem for deep reinforcement learning (DRL), given imperfect information in real world, huge state/action space, lots of data needed for training, associated time/cost, multi-agent interactions, potential negative impact to real world, etc. To bridge this reality gap, we proposed a DRL framework to direct transfer optimal policy learned from multi-tasks in source domain to unseen similar tasks in target domain without any further training in both domains. First, we distilled temporal-spatial relationships between cells and mobile users to scalable 3D image-like tensor to best characterize partially observed mobile network. Second, inspired by AlphaGo, we used a novel self-play mechanism to empower DRL agent to gradually improve its intelligence by competing for best record on multiple tasks. Third, a decentralized DRL method is proposed to coordinate multi-agents to compete and cooperate as a team to maximize global reward and minimize potential negative impact. Using 7693 unseen test tasks over 160 unseen simulated mobile networks and 6 field trials over 4 commercial mobile networks in real world, we demonstrated the capability of our approach to direct transfer the learning from one simulator to another simulator, and from simulation to real world. This is the first time that a DRL agent successfully transfers its learning directly from simulation to very complex real world problems with incomplete and imperfect information, huge state/action space and multi-agent interactions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/23/2020

Robust Reinforcement Learning-based Autonomous Driving Agent for Simulation and Real World

Deep Reinforcement Learning (DRL) has been successfully used to solve di...
research
04/14/2023

A Platform-Agnostic Deep Reinforcement Learning Framework for Effective Sim2Real Transfer in Autonomous Driving

Deep Reinforcement Learning (DRL) has shown remarkable success in solvin...
research
03/04/2022

Cloud-Edge Training Architecture for Sim-to-Real Deep Reinforcement Learning

Deep reinforcement learning (DRL) is a promising approach to solve compl...
research
01/09/2023

Network Slicing via Transfer Learning aided Distributed Deep Reinforcement Learning

Deep reinforcement learning (DRL) has been increasingly employed to hand...
research
04/29/2021

Hypernetwork Dismantling via Deep Reinforcement Learning

Network dismantling aims to degrade the connectivity of a network by rem...
research
05/07/2023

Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity

Deep Reinforcement Learning (DRL) has exhibited efficacy in resolving th...
research
05/06/2021

Time-Aware Q-Networks: Resolving Temporal Irregularity for Deep Reinforcement Learning

Deep Reinforcement Learning (DRL) has shown outstanding performance on i...

Please sign up or login with your details

Forgot password? Click here to reset