Online Network Source Optimization with Graph-Kernel MAB

07/07/2023
by   Laura Toni, et al.
0

We propose Grab-UCB, a graph-kernel multi-arms bandit algorithm to learn online the optimal source placement in large scale networks, such that the reward obtained from a priori unknown network processes is maximized. The uncertainty calls for online learning, which suffers however from the curse of dimensionality. To achieve sample efficiency, we describe the network processes with an adaptive graph dictionary model, which typically leads to sparse spectral representations. This enables a data-efficient learning framework, whose learning rate scales with the dimension of the spectral representation model instead of the one of the network. We then propose Grab-UCB, an online sequential decision strategy that learns the parameters of the spectral representation while optimizing the action strategy. We derive the performance guarantees that depend on network parameters, which further influence the learning curve of the sequential decision strategy We introduce a computationally simplified solving method, Grab-arm-Light, an algorithm that walks along the edges of the polytope representing the objective function. Simulations results show that the proposed online learning algorithm outperforms baseline offline methods that typically separate the learning phase from the testing one. The results confirm the theoretical findings, and further highlight the gain of the proposed online learning strategy in terms of cumulative regret, sample efficiency and computational complexity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/27/2018

IRSA Transmission Optimization via Online Learning

In this work, we propose a new learning framework for optimising transmi...
research
12/29/2021

Socially-Optimal Mechanism Design for Incentivized Online Learning

Multi-arm bandit (MAB) is a classic online learning framework that studi...
research
02/26/2019

Efficient online learning with kernels for adversarial large scale problems

We are interested in a framework of online learning with kernels for low...
research
05/21/2018

Online Learning in Kernelized Markov Decision Processes

We consider online learning for minimizing regret in unknown, episodic M...
research
08/09/2014

Normalized Online Learning

We introduce online learning algorithms which are independent of feature...
research
03/06/2023

Online Learning and Optimization for Queues with Unknown Demand Curve and Service Distribution

We investigate an optimization problem in a queueing system where the se...
research
03/13/2020

Identification of AC Networks via Online Learning

The increasing integration of intermittent renewable generation in power...

Please sign up or login with your details

Forgot password? Click here to reset