research
          
      
      ∙
      07/30/2021
    Indexability and Rollout Policy for Multi-State Partially Observable Restless Bandits
Restless multi-armed bandits with partially observable states has applic...
          
            research
          
      
      ∙
      02/08/2021
    Monte Carlo Rollout Policy for Recommendation Systems with Dynamic User Behavior
We model online recommendation systems using the hidden Markov multi-sta...
          
            research
          
      
      ∙
      07/25/2020
    Simulation Based Algorithms for Markov Decision Processes and Multi-Action Restless Bandits
We consider multi-dimensional Markov decision processes and formulate a ...
          
            research
          
      
      ∙
      10/04/2019
    Online repeated posted price auctions with a demand side platform
We consider an online ad network problem in which an ad exchange auction...
          
            research
          
      
      ∙
      04/18/2019
    Sequential Decision Making under Uncertainty with Dynamic Resource Constraints
This paper studies a class of constrained restless multi-armed bandits. ...
          
            research
          
      
      ∙
      01/04/2018