NAE-SAT-based probabilistic membership filters

01/18/2018
by   Chao Fang, et al.
0

Probabilistic membership filters are a type of data structure designed to quickly verify whether an element of a large data set belongs to a subset of the data. While false negatives are not possible, false positives are. Therefore, the main goal of any good probabilistic membership filter is to have a small false-positive rate while being memory efficient and fast to query. Although Bloom filters are fast to construct, their memory efficiency is bounded by a strict theoretical upper bound. Weaver et al. introduced random satisfiability-based filters that significantly improved the efficiency of the probabilistic filters, however, at the cost of solving a complex random satisfiability (SAT) formula when constructing the filter. Here we present an improved SAT filter approach with a focus on reducing the filter building times, as well as query times. Our approach is based on using not-all-equal (NAE) SAT formulas to build the filters, solving these via a mapping to random SAT using traditionally-fast random SAT solvers, as well as bit packing and the reduction of the number of hash functions. Paired with fast hardware, NAE-SAT filters could result in enterprise-size applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/17/2019

Shed More Light on Bloom Filter's Variants

Bloom Filter is a probabilistic membership data structure and it is exce...
research
01/19/2019

Dynamic Partition Bloom Filters: A Bounded False Positive Solution For Dynamic Set Membership (Extended Abstract)

Dynamic Bloom filters (DBF) were proposed by Guo et. al. in 2010 to tack...
research
08/13/2019

On Occupancy Moments and Bloom Filter Efficiency

Two multivariate committee distributions are shown to belong to Berg's f...
research
12/16/2019

Matrix Bloom Filter: An Efficient Probabilistic Data Structure for 2-tuple Batch Lookup

With the growing scale of big data, probabilistic structures receive inc...
research
06/27/2020

Optimizing Cuckoo Filter for high burst tolerance,low latency, and high throughput

In this paper, we present an implementation of a cuckoo filter for membe...
research
04/28/2020

Certifying Certainty and Uncertainty in Approximate Membership Query Structures – Extended Version

Approximate Membership Query structures (AMQs) rely on randomisation for...
research
08/25/2023

ChainedFilter: Combining Membership Filters by Chain Rule

Membership (membership query / membership testing) is a fundamental prob...

Please sign up or login with your details

Forgot password? Click here to reset