R-Learning Based Admission Control for Service Federation in Multi-domain 5G Networks

03/04/2021
by   Bahador Bakhshi, et al.
0

Service federation in 5G/B5G networks enables service providers to orchestrate network services across multiple domains where admission control is a key issue. For each demand, without knowing the future ones, the admission controller either determines the domain to deploy the demand or rejects it in order to maximize the long-term average profit. In this paper, at first, under the assumption of knowing the arrival and departure rates of demands, we obtain the optimal admission control policy by formulating the problem as a Markov decision process that is solved by the policy iteration method. As a practical solution, where the rates are not known, we apply the Q-Learning and R-Learning algorithms to approximate the optimal policy. The extensive simulation results show the learning approaches outperform the greedy policy, and while the performance of Q-Learning depends on the discount factor, the optimality gap of the R-Learning algorithm is at most 3-5 configuration.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/24/2021

Multi-Provider NFV Network Service Delegation via Average Reward Reinforcement Learning

In multi-provider 5G/6G networks, service delegation enables administrat...
research
01/10/2021

Learning Augmented Index Policy for Optimal Service Placement at the Network Edge

We consider the problem of service placement at the network edge, in whi...
research
02/04/2022

Learning a Discrete Set of Optimal Allocation Rules in Queueing Systems with Unknown Service Rates

We study learning-based admission control for a classical Erlang-B block...
research
12/30/2022

A deep real options policy for sequential service region design and timing

As various city agencies and mobility operators navigate toward innovati...
research
02/26/2019

Optimal and Fast Real-time Resources Slicing with Deep Dueling Neural Networks

Effective network slicing requires an infrastructure/network provider to...
research
03/29/2020

Optimizing Coordinated Vehicle Platooning: An Analytical Approach Based on Stochastic Dynamic Programming

Platooning connected and autonomous vehicles (CAVs) can improve traffic ...
research
02/27/2023

Dynamic Resource Allocation for Metaverse Applications with Deep Reinforcement Learning

This work proposes a novel framework to dynamically and effectively mana...

Please sign up or login with your details

Forgot password? Click here to reset