The Sketching Complexity of Graph and Hypergraph Counting

08/15/2018
by   John Kallaugher, et al.
0

Subgraph counting is a fundamental primitive in graph processing, with applications in social network analysis (e.g., estimating the clustering coefficient of a graph), database processing and other areas. The space complexity of subgraph counting has been studied extensively in the literature, but many natural settings are still not well understood. In this paper we revisit the subgraph (and hypergraph) counting problem in the sketching model, where the algorithm's state as it processes a stream of updates to the graph is a linear function of the stream. This model has recently received a lot of attention in the literature, and has become a standard model for solving dynamic graph streaming problems. In this paper we give a tight bound on the sketching complexity of counting the number of occurrences of a small subgraph H in a bounded degree graph G presented as a stream of edge updates. Specifically, we show that the space complexity of the problem is governed by the fractional vertex cover number of the graph H. Our subgraph counting algorithm implements a natural vertex sampling approach, with sampling probabilities governed by the vertex cover of H. Our main technical contribution lies in a new set of Fourier analytic tools that we develop to analyze multiplayer communication protocols in the simultaneous communication model, allowing us to prove a tight lower bound. We believe that our techniques are likely to find applications in other settings. Besides giving tight bounds for all graphs H, both our algorithm and lower bounds extend to the hypergraph setting, albeit with some loss in space complexity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/21/2021

Counting Simplices in Hypergraph Streams

We consider the problem of space-efficiently estimating the number of si...
research
11/19/2018

A Simple Sublinear-Time Algorithm for Counting Arbitrary Subgraphs via Edge Sampling

In the subgraph counting problem, we are given a input graph G(V, E) and...
research
09/11/2020

2D Fractional Cascading on Axis-aligned Planar Subdivisions

Fractional cascading is one of the influential techniques in data struct...
research
05/04/2021

An Optimal Algorithm for Triangle Counting

We present a new algorithm for approximating the number of triangles in ...
research
05/18/2023

(Noisy) Gap Cycle Counting Strikes Back: Random Order Streaming Lower Bounds for Connected Components and Beyond

We continue the study of the communication complexity of gap cycle count...
research
09/16/2022

Asymptotically Optimal Bounds for Estimating H-Index in Sublinear Time with Applications to Subgraph Counting

The h-index is a metric used to measure the impact of a user in a public...
research
12/18/2017

Experimental Evaluation of Counting Subgraph Isomorphisms in Classes of Bounded Expansion

Counting subgraph isomorphisms (also called motifs or graphlets) has bee...

Please sign up or login with your details

Forgot password? Click here to reset