Team Deep Mixture of Experts for Distributed Power Control

by   Matteo Zecchin, et al.

In the context of wireless networking, it was recently shown that multiple DNNs can be jointly trained to offer a desired collaborative behaviour capable of coping with a broad range of sensing uncertainties. In particular, it was established that DNNs can be used to derive policies that are robust with respect to the information noise statistic affecting the local information (e.g. CSI in a wireless network) used by each agent (e.g. transmitter) to make its decision. While promising, a major challenge in the implementation of such method is that information noise statistics may differ from agent to agent and, more importantly, that such statistics may not be available at the time of training or may evolve over time, making burdensome retraining necessary. This situation makes it desirable to devise a "universal" machine learning model, which can be trained once for all so as to allow for decentralized cooperation in any future feedback noise environment. With this goal in mind, we propose an architecture inspired from the well-known Mixture of Experts (MoE) model, which was previously used for non-linear regression and classification tasks in various contexts, such as computer vision and speech recognition. We consider the decentralized power control problem as an example to showcase the validity of the proposed model and to compare it against other power control algorithms. We show the ability of the so called Team-DMoE model to efficiently track time-varying statistical scenarios.


Deep Reinforcement Learning Based Power control for Wireless Multicast Systems

We consider a multicast scheme recently proposed for a wireless downlink...

Decentralized Ergodic Control: Distribution-Driven Sensing and Exploration for Multi-Agent Systems

We present a decentralized ergodic control policy for time-varying area ...

Robust Beamforming for AN Aided MISO SWIPT System with Unknown Eavesdroppers and Non-linear EH Model

This work studies a beamforming design for downlink transmission of a mu...

Machine learning based iterative learning control for non-repetitive time-varying systems

The repetitive tracking task for time-varying systems (TVSs) with non-re...

Semi-Supervised Imitation Learning of Team Policies from Suboptimal Demonstrations

We present Bayesian Team Imitation Learner (BTIL), an imitation learning...

Decentralized Connectivity Control in Quadcopters: a Field Study of Communication Performance

Redundancy and parallelism make decentralized multi-robot systems appeal...

Decentralized learning for wireless communications and networking

This chapter deals with decentralized learning algorithms for in-network...

Please sign up or login with your details

Forgot password? Click here to reset