Almost Optimal Bounds for Sublinear-Time Sampling of k-Cliques: Sampling Cliques is Harder Than Counting

12/07/2020
by   Talya Eden, et al.
0

In this work, we consider the problem of sampling a k-clique in a graph from an almost uniform distribution in sublinear time in the general graph query model. Specifically the algorithm should output each k-clique with probability (1±ϵ)/n_k, where n_k denotes the number of k-cliques in the graph and ϵ is a given approximation parameter. We prove that the query complexity of this problem is Θ^*(max{((nα)^k/2/ n_k)^1/k-1 , min{nα,nα^k-1/n_k}}). where n is the number of vertices in the graph, α is its arboricity, and Θ^* suppresses the dependence on (log n/ϵ)^O(k). Interestingly, this establishes a separation between approximate counting and approximate uniform sampling in the sublinear regime. For example, if k=3, α = O(1), and n_3 (the number of triangles) is Θ(n), then we get a lower bound of Ω(n^1/4) (for constant ϵ), while under these conditions, a (1±ϵ)-approximation of n_3 can be obtained by performing poly(log(n/ϵ)) queries (Eden, Ron and Seshadhri, SODA20). Our lower bound follows from a construction of a family of graphs with arboricity α such that in each graph there are n_k cliques (of size k), where one of these cliques is "hidden" and hence hard to sample. Our upper bound is based on defining a special auxiliary graph H_k, such that sampling edges almost uniformly in H_k translates to sampling k-cliques almost uniformly in the original graph G. We then build on a known edge-sampling algorithm (Eden, Ron and Rosenbaum, ICALP19) to sample edges in H_k, where the challenge is simulate queries to H_k while being given access only to G.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/04/2020

Sampling Arbitrary Subgraphs Exactly Uniformly in Sublinear Time

We present a simple sublinear-time algorithm for sampling an arbitrary s...
research
11/11/2018

Faster sublinear approximations of k-cliques for low arboricity graphs

Given query access to an undirected graph G, we consider the problem of ...
research
11/09/2022

Sampling an Edge in Sublinear Time Exactly and Optimally

Sampling edges from a graph in sublinear time is a fundamental problem a...
research
07/08/2021

Sampling and Counting Edges via Vertex Accesses

We consider the problems of sampling and counting edges from a graph on ...
research
08/18/2020

Amortized Edge Sampling

We present a sublinear time algorithm that allows one to sample multiple...
research
09/23/2020

Sampling an Edge Uniformly in Sublinear Time

The area of sublinear algorithms have recently received a lot of attenti...
research
02/21/2019

The Arboricity Captures the Complexity of Sampling Edges

In this paper, we revisit the problem of sampling edges in an unknown gr...

Please sign up or login with your details

Forgot password? Click here to reset