In this paper, we propose three generic models of capacitated coverage a...
Matrix product codes are generalizations of some well-known construction...
Off-policy evaluation and learning are concerned with assessing a given
...
We present a scalable and effective exploration strategy based on Thomps...
We present Queer in AI as a case study for community-led participatory d...
We study a multi-agent reinforcement learning (MARL) problem where the a...
There are many news articles reporting the obstacles confronting
poverty...
In this paper, we propose an online-matching-based model to study the
as...
We study the efficiency of Thompson sampling for contextual bandits. Exi...
We study the regret of Thompson sampling (TS) algorithms for exponential...
We consider Online Minimum Bipartite Matching under the uniform metric. ...
We consider online resource allocation under a typical non-profit settin...
In heterogeneous rank aggregation problems, users often exhibit various
...
Matching markets involve heterogeneous agents (typically from two partie...
Several scientific studies have reported the existence of the income gap...
With the popularity of the Internet, traditional offline resource alloca...
We study a general class of contextual bandits, where each context-actio...
In typical online matching problems, the goal is to maximize the number ...
We establish a new convergence analysis of stochastic gradient Langevin
...
We consider the Stochastic Matching problem, which is motivated by
appli...
Actor-critic (AC) methods have exhibited great empirical success compare...
Thompson sampling is one of the most widely used algorithms for many onl...
We study the two-armed bandit problem with subGaussian rewards. The
expl...
Rideshare platforms, when assigning requests to drivers, tend to maximiz...
Q-learning with neural network function approximation (neural Q-learning...
We propose the Heterogeneous Thurstone Model (HTM) for aggregating ranke...
Rideshare platforms such as Uber and Lyft dynamically dispatch drivers t...
Improving the sample efficiency in reinforcement learning has been a
lon...
We revisit the stochastic variance-reduced policy gradient (SVRPG) metho...
We propose a sample efficient stochastic variance-reduced cubic
regulari...
In bipartite matching problems, vertices on one side of a bipartite grap...
We propose two algorithms that can find local minima faster than the
sta...
We study finite-sum nonconvex optimization problems, where the objective...
We develop a framework for obtaining polynomial time approximation schem...
Online matching problems have garnered significant attention in recent y...
We propose a stochastic variance-reduced cubic regularized Newton method...
We propose a fast stochastic Hamilton Monte Carlo (HMC) method, for samp...
Space filling curves (SFCs) are widely used in the design of indexes for...
We propose stochastic optimization algorithms that can find local minima...
Bipartite matching markets pair agents on one side of a market with agen...
Column-sparse packing problems arise in several contexts in both
determi...
We present a unified framework to analyze the global convergence of Lang...
We study the estimation of the latent variable Gaussian graphical model
...
We propose communication-efficient distributed estimation and inference
...