Fast Correlation Function Calculator – A high-performance pair counting toolkit

01/29/2023
by   Cheng Zhao, et al.
0

Context. A novel high-performance exact pair counting toolkit called Fast Correlation Function Calculator (FCFC) is presented, which is publicly available at https://github.com/cheng-zhao/FCFC. Aims. As the rapid growth of modern cosmological datasets, the evaluation of correlation functions with observational and simulation catalogues has become a challenge. High-efficiency pair counting codes are thus in great demand. Methods. We introduce different data structures and algorithms that can be used for pair counting problems, and perform comprehensive benchmarks to identify the most efficient ones for real-world cosmological applications. We then describe the three levels of parallelisms used by FCFC – including SIMD, OpenMP, and MPI – and run extensive tests to investigate the scalabilities. Finally, we compare the efficiency of FCFC against alternative pair counting codes. Results. The data structures and histogram update algorithms implemented in FCFC are shown to outperform alternative methods. FCFC does not benefit much from SIMD as the bottleneck of our histogram update algorithm is mostly cache latency. Nevertheless, the efficiency of FCFC scales well with the numbers of OpenMP threads and MPI processes, albeit the speedups may be degraded with over a few thousand threads in total. FCFC is found to be faster than most (if not all) other public pair counting codes for modern cosmological pair counting applications.

READ FULL TEXT
research
11/15/2019

Corrfunc: Blazing fast correlation functions with AVX512F SIMD Intrinsics

Correlation functions are widely used in extra-galactic astrophysics to ...
research
04/20/2022

Scalable Motif Counting for Large-scale Temporal Graphs

One fundamental problem in temporal graph analysis is to count the occur...
research
08/13/2019

Pair correlation for Dedekind zeta functions of abelian extensions

Here we study problems related to the proportions of zeros, especially s...
research
07/05/2019

C^3 Framework: An Open-source PyTorch Code for Crowd Counting

This technical report attempts to provide efficient and solid kits addre...
research
08/31/2023

UltraLogLog: A Practical and More Space-Efficient Alternative to HyperLogLog for Approximate Distinct Counting

Since its invention HyperLogLog has become the standard algorithm for ap...
research
03/10/2021

Effectively Counting s-t Simple Paths in Directed Graphs

An important tool in analyzing complex social and information networks i...
research
03/08/2020

Multiset Synchronization with Counting Cuckoo Filters

Set synchronization is a fundamental task in distributed applications an...

Please sign up or login with your details

Forgot password? Click here to reset