Answering database queries while preserving privacy is an important prob...
Data sketching is a critical tool for distinct counting, enabling multis...
We initiate the study of zero-knowledge proofs for data streams. Streami...
We consider the federated frequency estimation problem, where each user ...
We address two major obstacles to practical use of supervised classifier...
There is great demand for scalable, secure, and efficient privacy-preser...
Many applications seek to produce differentially private statistics on
s...
With the recent bloom of data, there is a huge surge in threats against
...
Advances in communications, storage and computational technology allow
s...
Joining records with all other records that meet a linkage condition can...
Federated analytics relies on the collection of accurate statistics abou...
In this work we introduce a new protocol for vector aggregation in the
c...
The vulnerability of machine learning models to membership inference att...
We introduce Opacus, a free, open-source PyTorch library for training de...
Sharing sensitive data is vital in enabling many modern data analysis an...
Sharing trajectories is beneficial for many real-world applications, suc...
Private data generated by edge devices – from smart phones to automotive...
We study the fundamental problem of frequency estimation under both priv...
Private collection of statistics from a large distributed population is ...
Estimating the distribution and quantiles of data is a foundational task...
Given an n × d dimensional dataset A, a projection query specifies a
sub...
Approximating ranks, quantiles, and distributions over streaming data is...
Federated learning (FL) is a machine learning setting where many clients...
Parameterized complexity attempts to give a more fine-grained analysis o...
Scalable algorithms to solve optimization and regression tasks even
appr...
Problems involving the efficient arrangement of simple objects, as captu...
Quantiles, such as the median or percentiles, provide concise and useful...
Counting the fraction of a population having an input within a specified...
Clustering is a fundamental tool for analyzing large data sets. A rich b...
We consider the classic maximal and maximum independent set problems in ...
Work on approximate linear algebra has led to efficient distributed and
...
Many analysis and machine learning tasks require the availability of mar...
A current challenge for data management systems is to support the
constr...
Concern about how to aggregate sensitive user data without compromising
...