Improved Algorithms for Time Decay Streams

07/17/2019
by   Vladimir Braverman, et al.
0

In the time-decay model for data streams, elements of an underlying data set arrive sequentially with the recently arrived elements being more important. A common approach for handling large data sets is to maintain a coreset, a succinct summary of the processed data that allows approximate recovery of a predetermined query. We provide a general framework that takes any offline-coreset and gives a time-decay coreset for polynomial time decay functions. We also consider the exponential time decay model for k-median clustering, where we provide a constant factor approximation algorithm that utilizes the online facility location algorithm. Our algorithm stores O(k(hΔ)+h) points where h is the half-life of the decay function and Δ is the aspect ratio of the dataset. Our techniques extend to k-means clustering and M-estimators as well.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/25/2020

Approximation Algorithms for Clustering with Dynamic Points

In many classic clustering problems, we seek to sketch a massive data se...
research
09/16/2018

Constant factor FPT approximation for capacitated k-median

Capacitated k-median is one of the few outstanding optimization problems...
research
11/26/2017

Online Facility Location on Semi-Random Streams

In the streaming model, the order of the stream can significantly affect...
research
12/04/2022

Clustering Permutations: New Techniques with Streaming Applications

We study the classical metric k-median clustering problem over a set of ...
research
11/13/2020

Consistent k-Clustering for General Metrics

Given a stream of points in a metric space, is it possible to maintain a...
research
06/11/2019

Temporally-Biased Sampling Schemes for Online Model Management

To maintain the accuracy of supervised learning models in the presence o...
research
11/16/2017

Fast ordered sampling of DNA sequence variants

Explosive growth in the amount of genomic data is matched by increasing ...

Please sign up or login with your details

Forgot password? Click here to reset