We introduce TeraHAC, a (1+ϵ)-approximate hierarchical agglomerative
clu...
Estimating causal effects from randomized experiments is only feasible i...
We study the classic Euclidean Minimum Spanning Tree (MST) problem in th...
In online platforms, the impact of a treatment on an observed outcome ma...
In this paper, we study the setting in which data owners train machine
l...
Protecting user privacy is a major concern for many machine learning sys...
In this paper, we study the stochastic linear bandit problem under the
a...
Compact user representations (such as embeddings) form the backbone of
p...
We design learning rate schedules that minimize regret for SGD-based onl...
In fully dynamic clustering problems, a clustering of a given data set i...
In this paper, we design replicable algorithms in the context of statist...
Budget pacing is a popular service that has been offered by major intern...
This work studies the combinatorial optimization problem of finding an
o...
Major Internet advertising platforms offer budget pacing tools as a stan...
In digital online advertising, advertisers procure ad impressions
simult...
We study autobidding ad auctions with user costs, where each bidder is
v...
Hierarchical Clustering is a popular unsupervised machine learning metho...
The streaming model of computation is a popular approach for working wit...
We study the problem of graph clustering under a broad class of objectiv...
A fundamental procedure in the analysis of massive datasets is the
const...
The conclusions of randomized controlled trials may be biased when the
o...
In this work, we present and study a new framework for online learning i...
In this paper, we introduce the notion of reproducible policies in the
c...
Feature selection is the problem of selecting a subset of features for a...
The increasing availability of real-time data has fueled the prevalence ...
We study the price of anarchy of the first-price auction in the autobidd...
Personalized PageRank (PPR) is a fundamental tool in unsupervised learni...
When working with user data providing well-defined privacy guarantees is...
We study the private k-median and k-means clustering problem in d
dimens...
Representative selection (RS) is the problem of finding a small subset o...
Motivated by data analysis and machine learning applications, we conside...
We study a family of first-order methods with momentum based on mirror
d...
We present the first algorithm for fully dynamic k-centers clustering in...
We investigate the optimal design of experimental studies that have
pre-...
In classic auction theory, reserve prices are known to be effective for
...
Streaming computation plays an important role in large-scale data analys...
In this work, we study high-dimensional mean estimation under user-level...
We present new mechanisms for label differential privacy, a relaxation
o...
Graph clustering and community detection are central problems in modern ...
In online advertising markets, setting budget and return on investment (...
Recently, due to an increasing interest for transparency in artificial
i...
We study the widely used hierarchical agglomerative clustering (HAC)
alg...
In this paper, we study the r-gather problem, a natural formulation of
m...
How can we make use of information parallelism in online decision making...
Auto-bidding has become one of the main options for bidding in online
ad...
The bipartite experimental framework is a recently proposed causal setti...
In many sequential decision-making problems, the individuals are split i...
Online allocation problems with resource constraints are central problem...
A soft-max function has two main efficiency measures: (1) approximation ...
Unlike nonconvex optimization, where gradient descent is guaranteed to
c...