Multi-Armed Bandit Learning in IoT Networks: Learning helps even in non-stationary settings

07/02/2018
by   Rémi Bonnefoi, et al.
0

Setting up the future Internet of Things (IoT) networks will require to support more and more communicating devices. We prove that intelligent devices in unlicensed bands can use Multi-Armed Bandit (MAB) learning algorithms to improve resource exploitation. We evaluate the performance of two classical MAB learning algorithms, UCB1 and Thompson Sampling, to handle the decentralized decision-making of Spectrum Access, applied to IoT networks; as well as learning performance with a growing number of intelligent end-devices. We show that using learning algorithms does help to fit more devices in such networks, even when all end-devices are intelligent and are dynamically changing channel. In the studied scenario, stochastic MAB learning provides a up to 16 term of successful transmission probabilities, and has near optimal performance even in non-stationary and non-i.i.d. settings with a majority of intelligent devices.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/05/2019

GNU Radio Implementation of MALIN: "Multi-Armed bandits Learning for Internet-of-things Networks"

We implement an IoT network the following way: one gateway, one or sever...
research
02/27/2019

Upper-Confidence Bound for Channel Selection in LPWA Networks with Retransmissions

In this paper, we propose and evaluate different learning strategies bas...
research
03/30/2020

Decentralized Learning for Channel Allocation in IoT Networks over Unlicensed Bandwidth as a Contextual Multi-player Multi-armed Bandit Game

We study a decentralized channel allocation problem in an ad-hoc Interne...
research
05/08/2022

FOLPETTI: A Novel Multi-Armed Bandit Smart Attack for Wireless Networks

Channel hopping provides a defense mechanism against jamming attacks in ...
research
08/02/2023

Maximizing Success Rate of Payment Routing using Non-stationary Bandits

This paper discusses the system architecture design and deployment of no...
research
12/10/2021

SmartCon: Deep Probabilistic Learning Based Intelligent Link-Configuration in Narrowband-IoT Towards 5G and B5G

To enhance the coverage and transmission reliability, repetitions adopte...
research
01/12/2020

Collaborative Multi-Agent Multi-Armed Bandit Learning for Small-Cell Caching

This paper investigates learning-based caching in small-cell networks (S...

Please sign up or login with your details

Forgot password? Click here to reset