Decentralized Learning for Channel Allocation in IoT Networks over Unlicensed Bandwidth as a Contextual Multi-player Multi-armed Bandit Game

03/30/2020
by   Wenbo Wang, et al.
0

We study a decentralized channel allocation problem in an ad-hoc Internet of Things (IoT) network underlaying on a spectrum licensed to an existing wireless network. In the considered IoT network, the impoverished computation capability and the limited antenna number on the IoT devices make them difficult to acquire the Channel State Information (CSI) for the multi-channels over the shared spectrum. In addition, in practice, the unknown patterns of the licensed users' transmission activities and the time-varying CSI due to fast fading or mobility of the IoT devices can also cause stochastic changes in the channel quality. Therefore, decentralized IoT links are expected to learn their channel statistics online based on the partial observations, while acquiring no information about the channels that they are not operating on. Meanwhile, they also have to reach an efficient, collision-free solution of channel allocation on the basis of limited coordination or message exchange. Our study maps this problem into a contextual multi-player, multi-arm bandit game, for which we propose a purely decentralized, three-stage policy learning algorithm through trial-and-error. Our theoretical analysis shows that the proposed learning algorithm guarantees the IoT devices to jointly converge to the social-optimal channel allocation with a sub-linear (i.e., polylogarithmic) regret with respect to the operational time. Simulation results demonstrate that the proposed algorithm strikes a good balance between efficient channel allocation and network scalability when compared with the other state-of-the-art distributed multi-armed bandit algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/17/2019

Distributed Learning for Channel Allocation Over a Shared Spectrum

Channel allocation is the task of assigning channels to users such that ...
research
09/14/2019

IEEE 802.15.4.e TSCH-Based Scheduling for Throughput Optimization: A Combinatorial Multi-Armed Bandit Approach

In TSCH, which is a MAC mechanism set of the IEEE 802.15.4e amendment, c...
research
01/08/2021

Hermes: Decentralized Dynamic Spectrum Access System for Massive Devices Deployment in 5G

With the incoming 5G network, the ubiquitous Internet of Things (IoT) de...
research
02/27/2019

Upper-Confidence Bound for Channel Selection in LPWA Networks with Retransmissions

In this paper, we propose and evaluate different learning strategies bas...
research
07/02/2018

Multi-Armed Bandit Learning in IoT Networks: Learning helps even in non-stationary settings

Setting up the future Internet of Things (IoT) networks will require to ...
research
09/21/2022

Consensus-based Fast and Energy-Efficient Multi-Robot Task Allocation

In a multi-robot system, the appropriate allocation of the tasks to the ...
research
03/06/2020

Distributed Learning in Ad-Hoc Networks: A Multi-player Multi-armed Bandit Framework

Next-generation networks are expected to be ultra-dense with a very high...

Please sign up or login with your details

Forgot password? Click here to reset