Graph Signal Sampling via Reinforcement Learning

05/15/2018
by   Oleksii Abramenko, et al.
2

We formulate the problem of sampling and recovering clustered graph signal as a multi-armed bandit (MAB) problem. This formulation lends naturally to learning sampling strategies using the well-known gradient MAB algorithm. In particular, the sampling strategy is represented as a probability distribution over the individual arms of the MAB and optimized using gradient ascent. Some illustrative numerical experiments indicate that the sampling strategies based on the gradient MAB algorithm outperform existing sampling methods.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset