
Multinomial Logit Contextual Bandits: Provable Optimality and Practicality
We consider a sequential assortment selection problem where the user cho...
Online Allocation of Reusable Resources via Algorithms Guided by Fluid Approximations
We consider the problem of online allocation (matching and assortments) ...
SparsityAgnostic Lasso Bandit
We consider a stochastic contextual bandit problem where the dimension d...
Sequential Anomaly Detection using Inverse Reinforcement Learning
One of the most interesting application scenarios in anomaly detection i...
Online Allocation of Reusable Resources: Achieving Optimal Competitive Ratio
We study the problem of allocating a given set of resources to sequentia...
Directed Exploration in PAC ModelFree Reinforcement Learning
We study an exploration method for modelfree RL that generalizes the co...
Attainment Ratings for GraphQuery Recommendation
The video game industry is larger than both the film and music industrie...
Robust Implicit Backpropagation
Arguably the biggest challenge in applying neural networks is tuning the...
Passive Static Equilibrium with Frictional Contacts and Application to Grasp Stability Analysis
This paper studies the problem of passive grasp stability under an exter...
Unbiased scalable softmax optimization
Recent neural network and language models rely on softmax distributions ...
Passive Reaction Analysis for Grasp Stability
In this paper we focus on the following problem in multifingered roboti...
Unbiased Simulation for Optimizing Stochastic Function Compositions
In this paper, we introduce an unbiased gradient simulation algorithms f...
Garud Iyengar
