Counting Simplices in Hypergraph Streams

12/21/2021
by   Amit Chakrabarti, et al.
0

We consider the problem of space-efficiently estimating the number of simplices in a hypergraph stream. This is the most natural hypergraph generalization of the highly-studied problem of estimating the number of triangles in a graph stream. Our input is a k-uniform hypergraph H with n vertices and m hyperedges. A k-simplex in H is a subhypergraph on k+1 vertices X such that all k+1 possible hyperedges among X exist in H. The goal is to process a stream of hyperedges of H and compute a good estimate of T_k(H), the number of k-simplices in H. We design a suite of algorithms for this problem. Under a promise that T_k(H) ≥ T, our algorithms use at most four passes and together imply a space bound of O( ϵ^-2logδ^-1polylog n ·min{ m^1+1/k/T, m/T^2/(k+1)} ) for each fixed k ≥ 3, in order to guarantee an estimate within (1±ϵ)T_k(H) with probability at least 1-δ. We also give a simpler 1-pass algorithm that achieves O(ϵ^-2logδ^-1log n· (m/T) ( Δ_E + Δ_V^1-1/k )) space, where Δ_E (respectively, Δ_V) denotes the maximum number of k-simplices that share a hyperedge (respectively, a vertex). We complement these algorithmic results with space lower bounds of the form Ω(ϵ^-2), Ω(m^1+1/k/T), Ω(m/T^1-1/k) and Ω(mΔ_V^1/k/T) for multi-pass algorithms and Ω(mΔ_E/T) for 1-pass algorithms, which show that some of the dependencies on parameters in our upper bounds are nearly tight. Our techniques extend and generalize several different ideas previously developed for triangle counting in graphs, using appropriate innovations to handle the more complicated combinatorics of hypergraphs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/15/2018

The Sketching Complexity of Graph and Hypergraph Counting

Subgraph counting is a fundamental primitive in graph processing, with a...
research
03/29/2020

How the Degeneracy Helps for Triangle Counting in Graph Streams

We revisit the well-studied problem of triangle count estimation in grap...
research
06/29/2022

A note on the Tuza constant c_k for small k

For a hypergraph H, the transversal is a subset of vertices whose inters...
research
03/17/2022

Triangle and Four Cycle Counting with Predictions in Graph Streams

We propose data-driven one-pass streaming algorithms for estimating the ...
research
04/23/2020

Non-Adaptive Adaptive Sampling on Turnstile Streams

Adaptive sampling is a useful algorithmic tool for data summarization pr...
research
11/12/2020

Towards Tight Bounds for Spectral Sparsification of Hypergraphs

Cut and spectral sparsification of graphs have numerous applications, in...
research
12/08/2018

Counting Butterfies from a Large Bipartite Graph Stream

We consider the estimation of properties on massive bipartite graph stre...

Please sign up or login with your details

Forgot password? Click here to reset