Distributed Triangle Counting in the Graphulo Matrix Math Library

08/20/2017
by   Dylan Hutchison, et al.
0

Triangle counting is a key algorithm for large graph analysis. The Graphulo library provides a framework for implementing graph algorithms on the Apache Accumulo distributed database. In this work we adapt two algorithms for counting triangles, one that uses the adjacency matrix and another that also uses the incidence matrix, to the Graphulo library for server-side processing inside Accumulo. Cloud-based experiments show a similar performance profile for these different approaches on the family of power law Graph500 graphs, for which data skew increasingly bottlenecks. These results motivate the design of skew-aware hybrid algorithms that we propose for future work.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/16/2023

Fast Triangle Counting

Listing and counting triangles in graphs is a key algorithmic kernel for...
research
09/16/2020

Towards an Objective Metric for the Performance of Exact Triangle Count

The performance of graph algorithms is often measured in terms of the nu...
research
06/22/2016

From NoSQL Accumulo to NewSQL Graphulo: Design and Utility of Graph Algorithms inside a BigTable Database

Google BigTable's scale-out design for distributed key-value storage ins...
research
03/18/2020

GraphChallenge.org Triangle Counting Performance

The rise of graph analytic systems has created a need for new ways to me...
research
09/30/2021

Breaking the hegemony of the triangle method in clique detection

We consider the fundamental problem of detecting/counting copies of a fi...
research
09/25/2020

A Block-Based Triangle Counting Algorithm on Heterogeneous Environments

Triangle counting is a fundamental building block in graph algorithms. I...
research
03/14/2021

TRUST: Triangle Counting Reloaded on GPUs

Triangle counting is a building block for a wide range of graph applicat...

Please sign up or login with your details

Forgot password? Click here to reset