Collaborative Spatial Reuse in Wireless Networks via Selfish Multi-Armed Bandits

10/31/2017
by   Francesc Wilhelmi, et al.
0

Next-generation wireless deployments are characterized by being dense and uncoordinated, which often leads to inefficient use of resources and poor performance. To solve this, we envision the utilization of completely decentralized mechanisms that enhance Spatial Reuse (SR). In particular, we concentrate in Reinforcement Learning (RL), and more specifically, in Multi-Armed Bandits (MABs), to allow networks to modify both their transmission power and channel based on their experienced throughput. In this work, we study the exploration-exploitation trade-off by means of the ε-greedy, EXP3, UCB and Thompson sampling action-selection strategies. Our results show that optimal proportional fairness can be achieved, even if no information about neighboring networks is available to the learners and WNs operate selfishly. However, there is high temporal variability in the throughput experienced by the individual networks, specially for ε-greedy and EXP3. We identify the cause of this variability to be the adversarial setting of our setup in which the set of most played actions provide intermittent good/poor performance depending on the neighboring decisions. We also show that this variability is reduced using UCB and Thompson sampling, which are parameter-free policies that perform exploration according to the reward distribution of each action.

READ FULL TEXT

page 10

page 14

research
05/28/2018

Potential and Pitfalls of Multi-Armed Bandits for Decentralized Spatial Reuse in WLANs

Spatial Reuse (SR) has recently gained attention for performance maximiz...
research
11/28/2022

Cooperate or not Cooperate: Transfer Learning with Multi-Armed Bandit for Spatial Reuse in Wi-Fi

The exponential increase of wireless devices with highly demanding servi...
research
06/05/2020

Concurrent Decentralized Channel Allocation and Access Point Selection using Multi-Armed Bandits in multi BSS WLANs

Enterprise Wireless Local Area Networks (WLANs) consist of multiple Acce...
research
03/01/2019

Decentralized AP selection using Multi-Armed Bandits: Opportunistic ε-Greedy with Stickiness

WiFi densification leads to the existence of multiple overlapping covera...
research
01/14/2022

Application of Multi-Armed Bandits to Model-assisted designs for Dose-Finding Clinical Trials

We consider applying multi-armed bandits to model-assisted designs for d...
research
01/02/2020

Multi-Armed Bandits for Decentralized AP selection in Enterprise WLANs

WiFi densification leads to the existence of multiple overlapping covera...
research
03/30/2022

INSPIRE: Distributed Bayesian Optimization for ImproviNg SPatIal REuse in Dense WLANs

WLANs, which have overtaken wired networks to become the primary means o...

Please sign up or login with your details

Forgot password? Click here to reset