research
∙
07/15/2023
On the Robustness of Epoch-Greedy in Multi-Agent Contextual Bandit Mechanisms
Efficient learning in multi-armed bandit mechanisms such as pay-per-clic...
research
∙
05/18/2023
Black-Box Targeted Reward Poisoning Attack Against Online Deep Reinforcement Learning
We propose the first black-box targeted attack against online deep reinf...
research
∙
05/30/2022