Miroslav Dudík

research

∙ 06/09/2023

A Unified Model and Dimension for Interactive Estimation

We study an abstract framework for interactive learning called interacti...

0 Nataly Brukhim, et al. ∙

research

∙ 03/29/2023

Fairlearn: Assessing and Improving Fairness of AI Systems

Fairlearn is an open source project to help practitioners assess and imp...

1 Hilde Weerts, et al. ∙

research

∙ 05/06/2022

Convex Analysis at Infinity: An Introduction to Astral Space

Not all convex functions on ℝ^n have finite minimizers; some can only be...

0 Miroslav Dudík, et al. ∙

research

∙ 02/10/2022

Personalization Improves Privacy-Accuracy Tradeoffs in Federated Optimization

Large-scale machine learning systems often involve data distributed acro...

11 Alberto Bietti, et al. ∙

research

∙ 07/03/2021

Bayesian decision-making under misspecified priors with applications to meta-learning

Thompson sampling and other Bayesian sequential decision-making algorith...

13 Max Simchowitz, et al. ∙

research

∙ 02/15/2021

Log-time Prediction Markets for Interval Securities

We design a prediction market to recover a complete and fully general pr...

0 Miroslav Dudík, et al. ∙

research

∙ 06/19/2020

Gradient descent follows the regularization path for general losses

Recent work across many machine learning disciplines has highlighted tha...

0 Ziwei Ji, et al. ∙

research

∙ 06/09/2020

Constrained episodic reinforcement learning in concave-convex and knapsack settings

We propose an algorithm for tabular episodic reinforcement learning with...

8 Kianté Brantley, et al. ∙

research

∙ 07/22/2019

Doubly robust off-policy evaluation with shrinkage

We design a new family of estimators for off-policy evaluation in contex...

0 Yi Su, et al. ∙

research

∙ 06/21/2019

Reinforcement Learning with Convex Constraints

In standard reinforcement learning (RL), a learning agent seeks to optim...

14 Sobhan Miryoosefi, et al. ∙

research

∙ 05/30/2019

Fair Regression: Quantitative Definitions and Reduction-based Algorithms

In this paper, we study the prediction of a real-valued target, such as ...

0 Alekh Agarwal, et al. ∙

research

∙ 01/25/2019

Provably efficient RL with Rich Observations via Latent State Decoding

We study the exploration problem in episodic MDPs with rich observations...

0 Simon S. Du, et al. ∙

research

∙ 03/06/2018

A Reductions Approach to Fair Classification

We present a systematic approach for achieving fairness in a binary clas...

0 Alekh Agarwal, et al. ∙

research

∙ 03/03/2018

Practical Contextual Bandits with Regression Oracles

A major challenge in contextual bandits is to design general-purpose alg...

0 Dylan J. Foster, et al. ∙

research

∙ 03/01/2018

Hierarchical Imitation and Reinforcement Learning

We study the problem of learning policies over long time horizons. We pr...

0 Hoang M. Le, et al. ∙

research

∙ 06/09/2016

Arbitrage-Free Combinatorial Market Making via Integer Programming

We present a new combinatorial market maker that operates arbitrage-free...

0 Christian Kroer, et al. ∙

research

∙ 05/16/2016

Off-policy evaluation for slate recommendation

This paper studies the evaluation of policies that recommend an ordered ...

0 Adith Swaminathan, et al. ∙

research

∙ 10/07/2015

Budget Constraints in Prediction Markets

We give a detailed characterization of optimal trades under budget const...

0 Nikhil Devanur, et al. ∙

research

∙ 06/15/2015

Convex Risk Minimization and Conditional Probability Estimation

This paper proves, in very general settings, that convex risk minimizati...

0 Matus Telgarsky, et al. ∙

research

∙ 03/10/2015

Doubly Robust Policy Evaluation and Optimization

We study sequential decision making in environments where rewards are on...

0 Miroslav Dudík, et al. ∙

research

∙ 02/20/2015

Contextual Semibandits via Supervised Learning Oracles

We study an online decision making problem where on each round a learner...

0 Akshay Krishnamurthy, et al. ∙

research

∙ 07/30/2014

Market Making with Decreasing Utility for Information

We study information elicitation in cost-function-based combinatorial pr...

0 Miroslav Dudík, et al. ∙

research

∙ 10/16/2012

Sample-efficient Nonstationary Policy Evaluation for Contextual Bandits

We present and prove properties of a new offline policy evaluator for an...

0 Miroslav Dudík, et al. ∙

research

∙ 05/09/2012

First-Order Mixed Integer Linear Programming

Mixed integer linear programming (MILP) is a powerful representation oft...

0 Geoffrey Gordon, et al. ∙

research

∙ 10/19/2011

A Reliable Effective Terascale Linear Learning System

We present a system and a set of techniques for learning linear predicto...

0 Alekh Agarwal, et al. ∙

research

∙ 06/13/2011

Efficient Optimal Learning for Contextual Bandits

We address the problem of learning in an online setting where the learne...

0 Miroslav Dudík, et al. ∙

research

∙ 03/23/2011

Doubly Robust Policy Evaluation and Learning

We study decision making in environments where the reward is only partia...

0 Miroslav Dudík, et al. ∙

Miroslav Dudík

Featured Co-authors

Sign in with Google

Consider DeepAI Pro