research
∙
04/10/2022
Worst-case Performance of Greedy Policies in Bandits with Imperfect Context Observations
Contextual bandits are canonical models for sequential decision-making u...
research
∙
02/02/2022
Efficient Algorithms for Learning to Control Bandits with Unobserved Contexts
Contextual bandits are widely-used in the study of learning-based contro...
research
∙
10/23/2021