
Multinomial Logit Contextual Bandits: Provable Optimality and Practicality
We consider a sequential assortment selection problem where the user cho...
read it

Online Allocation of Reusable Resources via Algorithms Guided by Fluid Approximations
We consider the problem of online allocation (matching and assortments) ...
read it

SparsityAgnostic Lasso Bandit
We consider a stochastic contextual bandit problem where the dimension d...
read it

Sequential Anomaly Detection using Inverse Reinforcement Learning
One of the most interesting application scenarios in anomaly detection i...
read it

Online Allocation of Reusable Resources: Achieving Optimal Competitive Ratio
We study the problem of allocating a given set of resources to sequentia...
read it

Directed Exploration in PAC ModelFree Reinforcement Learning
We study an exploration method for modelfree RL that generalizes the co...
read it

Attainment Ratings for GraphQuery Recommendation
The video game industry is larger than both the film and music industrie...
read it

Robust Implicit Backpropagation
Arguably the biggest challenge in applying neural networks is tuning the...
read it

Passive Static Equilibrium with Frictional Contacts and Application to Grasp Stability Analysis
This paper studies the problem of passive grasp stability under an exter...
read it

Unbiased scalable softmax optimization
Recent neural network and language models rely on softmax distributions ...
read it

Passive Reaction Analysis for Grasp Stability
In this paper we focus on the following problem in multifingered roboti...
read it

Unbiased Simulation for Optimizing Stochastic Function Compositions
In this paper, we introduce an unbiased gradient simulation algorithms f...
read it
Garud Iyengar
is this you? claim profile