
Reinforcement Learning with Convex Constraints
In standard reinforcement learning (RL), a learning agent seeks to optim...
read it

Constrained episodic reinforcement learning in concaveconvex and knapsack settings
We propose an algorithm for tabular episodic reinforcement learning with...
read it

ArbitrageFree Combinatorial Market Making via Integer Programming
We present a new combinatorial market maker that operates arbitragefree...
read it

Budget Constraints in Prediction Markets
We give a detailed characterization of optimal trades under budget const...
read it

Doubly Robust Policy Evaluation and Optimization
We study sequential decision making in environments where rewards are on...
read it

Offpolicy evaluation for slate recommendation
This paper studies the evaluation of policies that recommend an ordered ...
read it

Market Making with Decreasing Utility for Information
We study information elicitation in costfunctionbased combinatorial pr...
read it

Convex Risk Minimization and Conditional Probability Estimation
This paper proves, in very general settings, that convex risk minimizati...
read it

Contextual Semibandits via Supervised Learning Oracles
We study an online decision making problem where on each round a learner...
read it

FirstOrder Mixed Integer Linear Programming
Mixed integer linear programming (MILP) is a powerful representation oft...
read it

Sampleefficient Nonstationary Policy Evaluation for Contextual Bandits
We present and prove properties of a new offline policy evaluator for an...
read it

A Reliable Effective Terascale Linear Learning System
We present a system and a set of techniques for learning linear predicto...
read it

Efficient Optimal Learning for Contextual Bandits
We address the problem of learning in an online setting where the learne...
read it

Doubly Robust Policy Evaluation and Learning
We study decision making in environments where the reward is only partia...
read it

Practical Contextual Bandits with Regression Oracles
A major challenge in contextual bandits is to design generalpurpose alg...
read it

A Reductions Approach to Fair Classification
We present a systematic approach for achieving fairness in a binary clas...
read it

Hierarchical Imitation and Reinforcement Learning
We study the problem of learning policies over long time horizons. We pr...
read it

Provably efficient RL with Rich Observations via Latent State Decoding
We study the exploration problem in episodic MDPs with rich observations...
read it

Fair Regression: Quantitative Definitions and Reductionbased Algorithms
In this paper, we study the prediction of a realvalued target, such as ...
read it

Doubly robust offpolicy evaluation with shrinkage
We design a new family of estimators for offpolicy evaluation in contex...
read it

Gradient descent follows the regularization path for general losses
Recent work across many machine learning disciplines has highlighted tha...
read it