Tiyuntsong: A Self-Play Reinforcement Learning Approach for ABR Video Streaming

11/15/2018
by   Tianchi Huang, et al.
0

Existing reinforcement learning(RL)-based adaptive bitrate(ABR) approaches outperform the previous fixed control rules based methods by improving the Quality of Experience(QoE) score, while the QoE metric can hardly provide clear guidance for optimization, resulting in the unexpected strategies. In this paper, we propose Tiyuntsong, a self-play reinforcement learning approach with generative adversarial network(GAN)-based method for ABR video streaming. Tiyuntsong learns strategies automatically by training two agents who are competing against each other. Note that the competition results are evaluated with the rule rather than a numerical QoE score, and the rule has a clear optimization goal. Meanwhile, we propose GAN Enhancement Module to extract hidden features from the past status for preserving the information without the limitations of sequence lengths. Using testbed experiments, we show that the utilization of GAN significantly improves the Tiyuntsong's performance. By comparing the performance of ABRs, we observe that Tiyuntsong also betters existing ABR algorithms in the underlying metrics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/22/2022

Network state Estimation using Raw Video Analysis: vQoS-GAN based non-intrusive Deep Learning Approach

Content based providers transmits real time complex signal such as video...
research
04/10/2023

Improving ABR Performance for Short Video Streaming Using Multi-Agent Reinforcement Learning with Expert Guidance

In the realm of short video streaming, popular adaptive bitrate (ABR) al...
research
05/23/2017

Enhanced Experience Replay Generation for Efficient Reinforcement Learning

Applying deep reinforcement learning (RL) on real systems suffers from s...
research
05/26/2020

Self-play Reinforcement Learning for Video Transmission

Video transmission services adopt adaptive algorithms to ensure users' d...
research
04/10/2023

Deep Reinforcement Learning with Importance Weighted A3C for QoE enhancement in Video Delivery Services

Adaptive bitrate (ABR) algorithms are used to adapt the video bitrate ba...
research
08/28/2020

Real-world Video Adaptation with Reinforcement Learning

Client-side video players employ adaptive bitrate (ABR) algorithms to op...
research
01/04/2019

QFlow: A Reinforcement Learning Approach to High QoE Video Streaming over Wireless Networks

Wireless Internet access has brought legions of heterogeneous applicatio...

Please sign up or login with your details

Forgot password? Click here to reset