The fast growth of computational power and scales of modern super-comput...
GPU-aware collective communication has become a major bottleneck for mod...
In the exascale computing era, optimizing MPI collective performance in
...
General matrix/matrix multiplication (GEMM) is crucial for scientific
co...
General Matrix Multiplication (GEMM) is a crucial algorithm for various
...
With the ever-increasing computing power of supercomputers and the growi...
One-sided dense matrix decompositions (e.g., Cholesky, LU, and QR) are t...
Transformer is the cornerstone model of Natural Language Processing (NLP...
Fine-grained sketch-based image retrieval (FG-SBIR) addresses the proble...
This paper present a strong data mining method based on rough set, which...
Today's scientific simulations require a significant reduction of data v...
Homomorphic Encryption (HE) is an emerging encryption scheme that allows...
Error-bounded lossy compression is becoming an indispensable technique f...
Basic Linear Algebra Subprograms (BLAS) is a core library in scientific
...
Soft error, namely silent corruption of signal or datum in a computer sy...
Efficient error-controlled lossy compressors are becoming critical to th...
Lossy compression is one of the most important strategies to resolve the...
This paper presents a novel accelerated exact k-means algorithm called t...
Convolutional neural networks (CNNs) are becoming more and more importan...
Neural Network based models have been state-of-the-art models for variou...
With ever-increasing volumes of scientific data produced by HPC applicat...
Error-controlled lossy compression has been studied for years because of...
Iterative methods are commonly used approaches to solve large, sparse li...
Variations in High Performance Computing (HPC) system software configura...
In situ lossy compression allowing user-controlled data loss can
signifi...
Because of vast volume of data being produced by today's scientific
simu...