Actor-Critic Scheduling for Path-Aware Air-to-Ground Multipath Multimedia Delivery

04/28/2022
by   Achilles Machumilane, et al.
1

Reinforcement Learning (RL) has recently found wide applications in network traffic management and control because some of its variants do not require prior knowledge of network models. In this paper, we present a novel scheduler for real-time multimedia delivery in multipath systems based on an Actor-Critic (AC) RL algorithm. We focus on a challenging scenario of real-time video streaming from an Unmanned Aerial Vehicle (UAV) using multiple wireless paths. The scheduler acting as an RL agent learns in real-time the optimal policy for path selection, path rate allocation and redundancy estimation for flow protection. The scheduler, implemented as a module of the GStreamer framework, can be used in real or simulated settings. The simulation results show that our scheduler can target a very low loss rate at the receiver by dynamically adapting in real-time the scheduling policy to the path conditions without performing training or relying on prior knowledge of network channel models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/05/2020

Using Soft Actor-Critic for Low-Level UAV Control

Unmanned Aerial Vehicles (UAVs), or drones, have recently been used in s...
research
11/11/2019

Real-Time Reinforcement Learning

Markov Decision Processes (MDPs), the mathematical framework underlying ...
research
06/04/2017

Actor-Critic for Linearly-Solvable Continuous MDP with Partially Known Dynamics

In many robotic applications, some aspects of the system dynamics can be...
research
12/01/2021

Homotopy Based Reinforcement Learning with Maximum Entropy for Autonomous Air Combat

The Intelligent decision of the unmanned combat aerial vehicle (UCAV) ha...
research
06/17/2019

PACMAN: A Planner-Actor-Critic Architecture for Human-Centered Planning and Learning

Conventional reinforcement learning (RL) allows an agent to learn polici...
research
11/27/2018

Target Driven Visual Navigation with Hybrid Asynchronous Universal Successor Representations

Being able to navigate to a target with minimal supervision and prior kn...
research
03/28/2022

Network Performance Estimator with Applications to Route Selection for IoT Multimedia Applications

Estimating the performance of multimedia traffic is important in numerou...

Please sign up or login with your details

Forgot password? Click here to reset