Data sketching is a critical tool for distinct counting, enabling multis...
We consider privacy in the context of streaming algorithms for cardinali...
Deep neural networks (DNNs) trained on one set of medical images often
e...
Consider the fundamental problem of drawing a simple random sample of si...
Counting the number of distinct elements on a set is needed in many
appl...
Nonlinear dimensionality reduction methods provide a valuable means to
v...
Bloom filters, cuckoo filters, and other approximate set membership sket...
Count distinct or cardinality estimates are widely used in network monit...
The Count-Min sketch is an important and well-studied data summarization...
We develop theory for nonlinear dimensionality reduction (NLDR). A numbe...
Hypothesis testing and other statistical inference procedures are most
e...
We introduce and study a new data sketch for processing massive datasets...
Sub-sampling is a common and often effective method to deal with the
com...
Sampling is a fundamental problem in both computer science and statistic...
In this paper, we tackle the problem of online semi-supervised learning
...
Existing approaches to analyzing the asymptotics of graph Laplacians
typ...