Karma: Adaptive Video Streaming via Causal Sequence Modeling

08/20/2023
by   Bowei Xu, et al.
0

Optimal adaptive bitrate (ABR) decision depends on a comprehensive characterization of state transitions that involve interrelated modalities over time including environmental observations, returns, and actions. However, state-of-the-art learning-based ABR algorithms solely rely on past observations to decide the next action. This paradigm tends to cause a chain of deviations from optimal action when encountering unfamiliar observations, which consequently undermines the model generalization. This paper presents Karma, an ABR algorithm that utilizes causal sequence modeling to improve generalization by comprehending the interrelated causality among past observations, returns, and actions and timely refining action when deviation occurs. Unlike direct observation-to-action mapping, Karma recurrently maintains a multi-dimensional time series of observations, returns, and actions as input and employs causal sequence modeling via a decision transformer to determine the next action. In the input sequence, Karma uses the maximum cumulative future quality of experience (QoE) (a.k.a, QoE-to-go) as an extended return signal, which is periodically estimated based on current network conditions and playback status. We evaluate Karma through trace-driven simulations and real-world field tests, demonstrating superior performance compared to existing state-of-the-art ABR algorithms, with an average QoE improvement ranging from 10.8 diverse network conditions. Furthermore, Karma exhibits strong generalization capabilities, showing leading performance under unseen networks in both simulations and real-world tests.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/10/2019

Forecasting Future Sequence of Actions to Complete an Activity

Future human action forecasting from partial observations of activities ...
research
05/21/2018

Imitating Latent Policies from Observation

We describe a novel approach to imitation learning that infers latent po...
research
10/12/2021

StARformer: Transformer with State-Action-Reward Representations

Reinforcement Learning (RL) can be considered as a sequence modeling tas...
research
05/26/2023

Emergent Agentic Transformer from Chain of Hindsight Experience

Large transformer models powered by diverse data and model scale have do...
research
12/17/2022

Inductive Attention for Video Action Anticipation

Anticipating future actions based on video observations is an important ...
research
10/11/2021

Towards Streaming Egocentric Action Anticipation

Egocentric action anticipation is the task of predicting the future acti...
research
05/05/2022

Identifying Cause-and-Effect Relationships of Manufacturing Errors using Sequence-to-Sequence Learning

In car-body production the pre-formed sheet metal parts of the body are ...

Please sign up or login with your details

Forgot password? Click here to reset