Age-Partitioned Bloom Filters

01/09/2020
by   Ariel Shtul, et al.
0

Bloom filters (BF) are widely used for approximate membership queries over a set of elements. BF variants allow removals, sets of unbounded size or querying a sliding window over an unbounded stream. However, for this last case the best current approaches are dictionary based (e.g., based on Cuckoo Filters or TinyTable), and it may seem that BF-based approaches will never be competitive to dictionary-based ones. In this paper we present Age-Partitioned Bloom Filters, a BF-based approach for duplicate detection in sliding windows that not only is competitive in time-complexity, but has better space usage than current dictionary-based approaches (e.g., SWAMP), at the cost of some moderate slack. APBFs retain the BF simplicity, unlike dictionary-based approaches, important for hardware-based implementations, and can integrate known improvements such as double hashing or blocking. We present an Age-Partitioned Blocked Bloom Filter variant which can operate with 2-3 cache-line accesses per insertion and around 2-4 per query, even for high accuracy filters.

READ FULL TEXT
research
09/24/2020

A Case for Partitioned Bloom Filters

In a partitioned Bloom Filter the m bit vector is split into k disjoint ...
research
09/14/2018

Approximate Query Processing over Static Sets and Sliding Windows

Indexing of static and dynamic sets is fundamental to a large set of app...
research
12/17/2019

Xor Filters: Faster and Smaller Than Bloom and Cuckoo Filters

The Bloom filter provides fast approximate set membership while using li...
research
11/12/2019

Fully-Dynamic Space-Efficient Dictionaries and Filters with Constant Number of Memory Accesses

A fully-dynamic dictionary is a data structure for maintaining sets that...
research
05/10/2020

Approaching Optimal Duplicate Detection in a Sliding Window

Duplicate detection is the problem of identifying whether a given item h...
research
06/11/2023

Time-limited Bloom Filter

A Bloom Filter is a probabilistic data structure designed to check, rapi...
research
08/25/2023

ChainedFilter: Combining Membership Filters by Chain Rule

Membership (membership query / membership testing) is a fundamental prob...

Please sign up or login with your details

Forgot password? Click here to reset