Ibdxnet: Leveraging InfiniBand in Highly Concurrent Java Applications

12/05/2018
by   Stefan Nothaas, et al.
0

In this report, we describe the design and implementation of Ibdxnet, a low-latency and high-throughput transport providing the benefits of InfiniBand networks to Java applications. Ibdxnet is part of the Java-based DXNet library, a highly concurrent and simple to use messaging stack with transparent serialization of messaging objects and focus on very small messages (< 64 bytes). Ibdxnet implements the transport interface of DXNet in Java and a custom C++ library in native space using JNI. Several optimizations in both spaces minimize context switching overhead between Java and C++ and are not burdening message latency or throughput. Communication is implemented using the messaging verbs of the ibverbs library complemented by an automatic connection management in the native library. We compared DXNet with the Ibdxnet transport to the MPI implementations FastMPJ and MVAPICH2. For small messages up to 64 bytes using multiple threads, DXNet with the Ibdxnet transport achieves a bi-directional message rate of 10 million messages per second and surpasses FastMPJ by a factor of 4 and MVAPICH by a factor of 2. Furthermore, DXNet scales well on a high load all-to-all communication with up to 8 nodes achieving a total aggregated message rate of 43.4 million messages per second for small messages and a throughput saturation of 33.6 GB/s with only 2 kb message size.

READ FULL TEXT

page 2

page 7

page 8

page 9

page 14

research
05/15/2020

kiwiPy: Robust, high-volume, messaging for big-data and computational science workflows

In this work we present kiwiPy, a Python library designed to support rob...
research
03/28/2021

MT-lib: A Topology-aware Message Transfer Library for Graph500 on Supercomputers

We present MT-lib, an efficient message transfer library for messages ga...
research
02/22/2021

CoinTossX: An open-source low-latency high-throughput matching engine

We deploy and demonstrate the CoinTossX low-latency, high-throughput, op...
research
08/16/2019

Path-Sensitive Atomic Commit: Local Coordination Avoidance for Distributed Transactions (Technical Report)

Concurrent objects with asynchronous messaging are an increasingly popul...
research
02/13/2020

Scheduling periodic messages on a shared link

Cloud-RAN is a recent architecture for mobile networks where the process...
research
09/28/2022

Accelerating netty-based applications through transparent InfiniBand support

Many big-data frameworks are written in Java, e.g. Apache Spark, Flink a...
research
08/18/2015

Cold Object Identification in the Java Virtual Machine

Many Java applications instantiate objects within the Java heap that are...

Please sign up or login with your details

Forgot password? Click here to reset