Multi-Armed Bandits for Decentralized AP selection in Enterprise WLANs

01/02/2020
by   Marc Carrascosa, et al.
0

WiFi densification leads to the existence of multiple overlapping coverage areas, which allows user stations (STAs) to choose between different Access Points (APs). The standard WiFi association method makes the STAs select the AP with the strongest signal, which in many cases leads to underutilization of some APs while overcrowding others. To mitigate this situation, Reinforcement Learning techniques such as Multi-Armed Bandits can be used to dynamically learn the optimal mapping between APs and STAs, and so redistribute the STAs among the available APs accordingly. This is an especially challenging problem since the network response observed by a given STA depends on the behavior of the others, and so it is very difficult to predict without a global view of the network. In this paper, we focus on solving this problem in a decentralized way, where STAs independently explore the different APs inside their coverage range, and select the one that better satisfy their needs. To do it, we propose a novel approach called Opportunistic epsilon-greedy with Stickiness that halts the exploration when a suitable AP is found, only resuming the exploration after several unsatisfactory association rounds. With this approach, we reduce significantly the network response dynamics, improving the ability of the STAs to find a solution faster, as well as achieving a more efficient use of the network resources. We investigate how the characteristics of the scenario (position of the APs and STAs, traffic loads, and channel allocation strategies) impact the learning process and the achievable performance. We also show that not all the STAs have to implement the proposed solution to improve their performance. Finally, we study the case where stations arrive progressively to the system, showing that the considered approach is also suitable in such a non-stationary set-up.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/01/2019

Decentralized AP selection using Multi-Armed Bandits: Opportunistic ε-Greedy with Stickiness

WiFi densification leads to the existence of multiple overlapping covera...
research
06/05/2020

Concurrent Decentralized Channel Allocation and Access Point Selection using Multi-Armed Bandits in multi BSS WLANs

Enterprise Wireless Local Area Networks (WLANs) consist of multiple Acce...
research
08/23/2018

Diversity-Driven Selection of Exploration Strategies in Multi-Armed Bandits

We consider a scenario where an agent has multiple available strategies ...
research
03/23/2020

Contextual Bandit-Based Channel Selection for Wireless LANs with Interference-Driven Feature Extraction

This paper proposes a radio channel selection algorithm based on a conte...
research
10/31/2017

Collaborative Spatial Reuse in Wireless Networks via Selfish Multi-Armed Bandits

Next-generation wireless deployments are characterized by being dense an...
research
09/18/2023

Task Selection and Assignment for Multi-modal Multi-task Dialogue Act Classification with Non-stationary Multi-armed Bandits

Multi-task learning (MTL) aims to improve the performance of a primary t...
research
12/27/2017

Active Search for High Recall: a Non-Stationary Extension of Thompson Sampling

We consider the problem of Active Search, where a maximum of relevant ob...

Please sign up or login with your details

Forgot password? Click here to reset