Multi-Player Multi-Armed Bandit Based Resource Allocation for D2D Communications

12/12/2018
by   Anushree Neogi, et al.
0

Device-to-device (D2D) communications is expected to play a significant role in increasing the system capacity of the fifth generation (5G) wireless networks. To accomplish this, efficient power and resource allocation algorithms need to be devised for the D2D users. Since the D2D users are treated as secondary users, their interference to the cellular users (CUs) should not hamper the CU communications. Most of the prior works on D2D resource allocation assume full channel state information (CSI) at the base station (BS). However, the required channel gains for the D2D pairs may not be known. To acquire these in a fast fading channel requires extra power and control overhead. In this paper, we assume partial CSI and formulate the D2D power and resource allocation problem as a multi-armed bandit problem. We propose a power allocation scheme for the D2D users in which the BS allocates power to the D2D users if a certain signal-to-interference-plus-noise ratio (SINR) is maintained for the CUs. In a single player environment a D2D user selects a CU in every time slot by employing UCB1 algorithm. Since this resource allocation problem can also be considered as an adversarial bandit problem we have applied the exponential-weight algorithm for exploration and exploitation (Exp3) to solve it. In a multiple player environment, we extend UCB1 and Exp3 to multiple D2D users. We also propose two algorithms that are based on distributed learning algorithm with fairness (DLF) and kth-UCB1 algorithms in which the D2D users are ranked. Our simulation results show that our proposed algorithms are fair and achieve good performance.

READ FULL TEXT
research
11/06/2017

Resource Allocation for D2D Communications with Partial Channel State Information

Enhancement of system capacity is one of the objectives of the fifth gen...
research
05/09/2019

Joint power and resource allocation of D2D communication with low-resolution ADC

This paper considers the joint power control and resource allocation for...
research
02/26/2020

Robust Underlay Device-to-Device Communications on Multiple Channels

Most recent works in device-to-device (D2D) underlay communications focu...
research
05/11/2021

Resource Allocation for Smooth Streaming: Non-convexity and Bandits

User dissatisfaction due to buffering pauses during streaming is a signi...
research
06/06/2018

When Distributed outperforms Centralized Scheduling in D2D-Enabled Cellular Networks

Device-to-device (D2D) communications is a promising technique for impro...
research
03/25/2023

Hierarchical Multi-Agent Multi-Armed Bandit for Resource Allocation in Multi-LEO Satellite Constellation Networks

Low Earth orbit (LEO) satellite constellation is capable of providing gl...
research
09/14/2019

IEEE 802.15.4.e TSCH-Based Scheduling for Throughput Optimization: A Combinatorial Multi-Armed Bandit Approach

In TSCH, which is a MAC mechanism set of the IEEE 802.15.4e amendment, c...

Please sign up or login with your details

Forgot password? Click here to reset