Self-play Reinforcement Learning for Video Transmission

05/26/2020
by   Tianchi Huang, et al.
0

Video transmission services adopt adaptive algorithms to ensure users' demands. Existing techniques are often optimized and evaluated by a function that linearly combines several weighted metrics. Nevertheless, we observe that the given function fails to describe the requirement accurately. Thus, such proposed methods might eventually violate the original needs. To eliminate this concern, we propose Zwei, a self-play reinforcement learning algorithm for video transmission tasks. Zwei aims to update the policy by straightforwardly utilizing the actual requirement. Technically, Zwei samples a number of trajectories from the same starting point and instantly estimates the win rate w.r.t the competition outcome. Here the competition result represents which trajectory is closer to the assigned requirement. Subsequently, Zwei optimizes the strategy by maximizing the win rate. To build Zwei, we develop simulation environments, design adequate neural network models, and invent training methods for dealing with different requirements on various video transmission scenarios. Trace-driven analysis over two representative tasks demonstrates that Zwei optimizes itself according to the assigned requirement faithfully, outperforming the state-of-the-art methods under all considered scenarios.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/07/2018

QARC: Video Quality Aware Rate Control for Real-Time Video Streaming based on Deep Reinforcement Learning

Real-time video streaming is now one of the main applications in all net...
research
05/07/2018

QARC: Video Quality Aware Rate Control for Real-Time Video Streaming via Deep Reinforcement Learning

Due to the fluctuation of throughput under various network conditions, h...
research
11/15/2018

Tiyuntsong: A Self-Play Reinforcement Learning Approach for ABR Video Streaming

Existing reinforcement learning(RL)-based adaptive bitrate(ABR) approach...
research
08/06/2019

Comyco: Quality-Aware Adaptive Video Streaming via Imitation Learning

Learning-based Adaptive Bit Rate (ABR) method, aiming to learn outstandi...
research
08/28/2020

Investigating Taxi and Uber competition in New York City: Multi-agent modeling by reinforcement-learning

The taxi business has been overly regulated for many decades. Regulation...
research
06/08/2020

A Comparison of Self-Play Algorithms Under a Generalized Framework

Throughout scientific history, overarching theoretical frameworks have a...

Please sign up or login with your details

Forgot password? Click here to reset