research
          
      
      ∙
      01/29/2023
    Recommender system as an exploration coordinator: a bounded O(1) regret algorithm for large platforms
On typical modern platforms, users are only able to try a small fraction...
          
            research
          
      
      ∙
      05/29/2019
    Learning scalable and transferable multi-robot/machine sequential assignment planning via graph embedding
Can the success of reinforcement learning methods for simple combinatori...
          
            research
          
      
      ∙
      05/29/2019
     
             
  
  
     
                             share
 share