Randomized Greedy Learning for Non-monotone Stochastic Submodular Maximization Under Full-bandit Feedback

02/02/2023
∙
by   Fares Fourati, et al.
∙
0
∙

We investigate the problem of unconstrained combinatorial multi-armed bandits with full-bandit feedback and stochastic rewards for submodular maximization. Previous works investigate the same problem assuming a submodular and monotone reward function. In this work, we study a more general problem, i.e., when the reward function is not necessarily monotone, and the submodularity is assumed only in expectation. We propose Randomized Greedy Learning (RGL) algorithm and theoretically prove that it achieves a 1/2-regret upper bound of 𝒊Ėƒ(n T^2/3) for horizon T and number of arms n. We also show in experiments that RGL empirically outperforms other full-bandit variants in submodular and non-submodular settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
∙ 05/21/2023

Bandit Multi-linear DR-Submodular Maximization and Its Applications on Adversarial Submodular Bandits

We investigate the online bandit learning of the monotone multi-linear D...
research
∙ 03/23/2023

Stochastic Submodular Bandits with Delayed Composite Anonymous Bandit Feedback

This paper investigates the problem of combinatorial multiarmed bandits ...
research
∙ 12/03/2021

On Submodular Contextual Bandits

We consider the problem of contextual bandits where actions are subsets ...
research
∙ 01/30/2023

A Framework for Adapting Offline Algorithms to Solve Combinatorial Multi-Armed Bandit Problems with Bandit Feedback

We investigate the problem of stochastic, combinatorial multi-armed band...
research
∙ 01/30/2021

Recurrent Submodular Welfare and Matroid Blocking Bandits

A recent line of research focuses on the study of the stochastic multi-a...
research
∙ 05/22/2023

Bandit Submodular Maximization for Multi-Robot Coordination in Unpredictable and Partially Observable Environments

We study the problem of multi-agent coordination in unpredictable and pa...
research
∙ 07/14/2022

Influential Billboard Slot Selection using Pruned Submodularity Graph

Billboard Advertisement has emerged as an effective out-of-home advertis...

Please sign up or login with your details

Forgot password? Click here to reset