ChainedFilter: Combining Membership Filters by Chain Rule

08/25/2023
by   Haoyu Li, et al.
0

Membership (membership query / membership testing) is a fundamental problem across databases, networks and security. However, previous research has primarily focused on either approximate solutions, such as Bloom Filters, or exact methods, like perfect hashing and dictionaries, without attempting to develop a an integral theory. In this paper, we propose a unified and complete theory, namely chain rule, for general membership problems, which encompasses both approximate and exact membership as extreme cases. Building upon the chain rule, we introduce a straightforward yet versatile algorithm framework, namely ChainedFilter, to combine different elementary filters without losing information. Our evaluation results demonstrate that ChainedFilter performs well in many applications: (1) it requires only 26 theoretical lower bound for implicit static dictionary, (2) it requires only 0.22 additional bit per item over the theoretical lower bound for lossless data compression, (3) it reduces up to 31 Hashing, (4) it reduces up to 36 Filter under the same space cost in RocksDB database, and (5) it reduces up to 99.1

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/05/2020

Conditional Cuckoo Filters

Bloom filters, cuckoo filters, and other approximate set membership sket...
research
10/07/2019

RAMBO: Repeated And Merged Bloom Filter for Multiple Set Membership Testing (MSMT) in Sub-linear time

Approximate set membership is a common problem with wide applications in...
research
12/17/2019

Xor Filters: Faster and Smaller Than Bloom and Cuckoo Filters

The Bloom filter provides fast approximate set membership while using li...
research
01/18/2018

NAE-SAT-based probabilistic membership filters

Probabilistic membership filters are a type of data structure designed t...
research
05/03/2020

A Dynamic Space-Efficient Filter with Constant Time Operations

A dynamic dictionary is a data structure that maintains sets of cardinal...
research
01/09/2020

Age-Partitioned Bloom Filters

Bloom filters (BF) are widely used for approximate membership queries ov...
research
07/16/2018

A Lyra2 FPGA Core for Lyra2REv2-Based Cryptocurrencies

Lyra2REv2 is a hashing algorithm that consists of a chain of individual ...

Please sign up or login with your details

Forgot password? Click here to reset