Wireless Optimisation via Convex Bandits: Unlicensed LTE/WiFi Coexistence

02/05/2018
by   Cristina Cano, et al.
0

Bandit Convex Optimisation (BCO) is a powerful framework for sequential decision-making in non-stationary and partially observable environments. In a BCO problem, a decision-maker sequentially picks actions to minimize the cumulative cost associated with these decisions, all while receiving partial feedback about the state of the environment. This formulation is a very natural fit for wireless-network optimisation problems and has great application potential since: i) instead of assuming full observability of the network state, it only requires the metric to optimise as input, and ii) it provides strong performance guarantees while making only minimal assumptions about the network dynamics. Despite these advantages, BCO has not yet been explored in the context of wireless-network optimisation. In this paper, we make the first steps to demonstrate the potential of BCO techniques by formulating an unlicensed LTE/WiFi fair coexistence use case in the framework, and providing experimental results in a simulated environment. On the algorithmic front, we propose a simple and natural sequential multi-point BCO algorithm amenable to wireless networking optimisation, and provide its theoretical analysis. We expect the contributions of this paper to pave the way to further research on the application of online convex methods in the bandit setting.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/29/2019

Bandit Convex Optimization in Non-stationary Environments

Bandit Convex Optimization (BCO) is a fundamental framework for modeling...
research
09/05/2020

Unifying Clustered and Non-stationary Bandits

Non-stationary bandits and online clustering of bandits lift the restric...
research
08/21/2023

Cost-Efficient Online Decision Making: A Combinatorial Multi-Armed Bandit Approach

Online decision making plays a crucial role in numerous real-world appli...
research
03/08/2021

Bandit Linear Optimization for Sequential Decision Making and Extensive-Form Games

Tree-form sequential decision making (TFSDM) extends classical one-shot ...
research
01/13/2022

Non-Stationary Representation Learning in Sequential Linear Bandits

In this paper, we study representation learning for multi-task decision-...
research
06/21/2021

Smooth Sequential Optimisation with Delayed Feedback

Stochastic delays in feedback lead to unstable sequential learning using...
research
02/14/2020

On State Variables, Bandit Problems and POMDPs

State variables are easily the most subtle dimension of sequential decis...

Please sign up or login with your details

Forgot password? Click here to reset