Graph Neural Bandits

08/21/2023
by   Yunzhe Qi, et al.
0

Contextual bandits algorithms aim to choose the optimal arm with the highest reward out of a set of candidates based on the contextual information. Various bandit algorithms have been applied to real-world applications due to their ability of tackling the exploitation-exploration dilemma. Motivated by online recommendation scenarios, in this paper, we propose a framework named Graph Neural Bandits (GNB) to leverage the collaborative nature among users empowered by graph neural networks (GNNs). Instead of estimating rigid user clusters as in existing works, we model the "fine-grained" collaborative effects through estimated user graphs in terms of exploitation and exploration respectively. Then, to refine the recommendation strategy, we utilize separate GNN-based models on estimated user graphs for exploitation and adaptive exploration. Theoretical analysis and experimental results on multiple real data sets in comparison with state-of-the-art baselines are provided to demonstrate the effectiveness of our proposed framework.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/08/2022

Neural Bandit with Arm Group Graph

Contextual bandits aim to identify among a set of arms the optimal one w...
research
10/07/2021

EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits

Contextual multi-armed bandits have been studied for decades and adapted...
research
04/02/2020

Hierarchical Adaptive Contextual Bandits for Resource Constraint based Recommendation

Contextual multi-armed bandit (MAB) achieves cutting-edge performance on...
research
10/12/2022

Maximum entropy exploration in contextual bandits with neural networks and energy based models

Contextual bandits can solve a huge range of real-world problems. Howeve...
research
06/28/2020

Kernel Density Estimation based Factored Relevance Model for Multi-Contextual Point-of-Interest Recommendation

An automated contextual suggestion algorithm is likely to recommend cont...
research
06/25/2021

Knowledge Infused Policy Gradients with Upper Confidence Bound for Relational Bandits

Contextual Bandits find important use cases in various real-life scenari...
research
11/19/2017

Estimation Considerations in Contextual Bandits

Contextual bandit algorithms seek to learn a personalized treatment assi...

Please sign up or login with your details

Forgot password? Click here to reset