b'Eric Mazumdar'

research

∙ 07/03/2023

Coupled Gradient Flows for Strategic Non-Local Distribution Shift

We propose a novel framework for analyzing the dynamics of distribution ...

0 Lauren Conger, et al. ∙

research

∙ 03/03/2023

A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic Games

We study two-player zero-sum stochastic games, and propose a form of ind...

0 Zaiwei Chen, et al. ∙

research

∙ 02/08/2023

Algorithmic Collective Action in Machine Learning

We initiate a principled study of algorithmic collective action on digit...

0 Moritz Hardt, et al. ∙

research

∙ 02/02/2023

Convergent First-Order Methods for Bi-level Optimization and Stackelberg Games

We propose an algorithm to solve a class of bi-level optimization proble...

0 Chinmay Maheshwari, et al. ∙

research

∙ 08/02/2022

A Note on Zeroth-Order Optimization on the Simplex

We construct a zeroth-order gradient estimator for a smooth function def...

0 Tijana Zrnic, et al. ∙

research

∙ 06/22/2022

Langevin Monte Carlo for Contextual Bandits

We study the efficiency of Thompson sampling for contextual bandits. Exi...

23 Pan Xu, et al. ∙

research

∙ 06/06/2022

Decentralized, Communication- and Coordination-free Learning in Structured Matching Markets

We study the problem of online learning in competitive settings in the c...

0 Chinmay Maheshwari, et al. ∙

research

∙ 06/23/2021

Who Leads and Who Follows in Strategic Classification?

As predictive models are deployed into the real world, they must increas...

9 Tijana Zrnic, et al. ∙

research

∙ 06/16/2021

Zeroth-Order Methods for Convex-Concave Minmax Problems: Applications to Decision-Dependent Risk Minimization

Min-max optimization is emerging as a key framework for analyzing proble...

5 Chinmay Maheshwari, et al. ∙

research

∙ 04/27/2021

Fast Distributionally Robust Learning with Variance Reduced Min-Max Optimization

Distributionally robust supervised learning (DRSL) is emerging as a key ...

18 Yaodong Yu, et al. ∙

research

∙ 10/26/2020

Expert Selection in High-Dimensional Markov Decision Processes

In this work we present a multi-armed bandit framework for online expert...

8 Vicenc Rubies Royo, et al. ∙

research

∙ 04/06/2020

Technical Report: Adaptive Control for Linearizable Systems Using On-Policy Reinforcement Learning

This paper proposes a framework for adaptively learning a feedback linea...

5 Tyler Westenbroek, et al. ∙

research

∙ 02/23/2020

On Thompson Sampling with Langevin Algorithms

Thompson sampling is a methodology for multi-armed bandit problems that ...

9 Eric Mazumdar, et al. ∙

research

∙ 02/03/2020

Local Nash Equilibria are Isolated, Strict Local Nash Equilibria in `Almost All' Zero-Sum Continuous Games

We prove that differential Nash equilibria are generic amongst local Nas...

0 Eric Mazumdar, et al. ∙

research

∙ 10/29/2019

Feedback Linearization for Unknown Systems via Reinforcement Learning

We present a novel approach to control design for nonlinear systems, whi...

1 Tyler Westenbroek, et al. ∙

research

∙ 07/08/2019

Policy-Gradient Algorithms Have No Guarantees of Convergence in Continuous Action and State Multi-Agent Settings

We show by counterexample that policy-gradient algorithms have no guaran...

1 Eric Mazumdar, et al. ∙

research

∙ 05/30/2019

Convergence Analysis of Gradient-Based Learning with Non-Uniform Learning Rates in Non-Cooperative Multi-Agent Settings

Considering a class of gradient-based multi-agent learning algorithms in...

0 Benjamin Chasnov, et al. ∙

research

∙ 04/16/2018

On the Convergence of Competitive, Multi-Agent Gradient-Based Learning

As learning algorithms are increasingly deployed in markets and other co...

0 Eric Mazumdar, et al. ∙

research

∙ 03/29/2017

Inverse Risk-Sensitive Reinforcement Learning

We address the problem of inverse reinforcement learning in Markov decis...

0 Lillian J. Ratliff, et al. ∙

Eric Mazumdar

Featured Co-authors

Sign in with Google

Consider DeepAI Pro