Overhead Management in Multi-Core Environment

01/31/2022
by   Urmila Shrawankar, et al.
0

In multi-core systems, various factors like inter-process communication, dependency, resource sharing and scheduling, level of parallelism, synchronization, number of available cores etc. influence the extent of possible High Performance Computing parallelization. These parameters if not managed to the root level, later surface as overheads during execution. This paper emphasizes on these parameters of parallelism, their overheads of parallelization and its effective management for optimal parallel execution under any domain. As a whole, we focus on the Dense Linear Algebra (DLA) domain and specifically on Matrix Multiplication and sorting domains. These domains are chosen as they find application in various sectors of scientific and mathematical applications. The comparative analysis of results obtained clarifies the trade-off between serial and parallel execution of DLA problems the surfacing overheads and their possible and effective management.

READ FULL TEXT
research
09/21/2022

POAS: A high-performance scheduling framework for exploiting Accelerator Level Parallelism

Heterogeneous computing is becoming mainstream in all scopes. This new e...
research
11/03/2016

Generating Families of Practical Fast Matrix Multiplication Algorithms

Matrix multiplication (GEMM) is a core operation to numerous scientific ...
research
11/26/2022

Profile-Guided Parallel Task Extraction and Execution for Domain Specific Heterogeneous SoC

In this study, we introduce a methodology for automatically transforming...
research
11/05/2017

HPX Smart Executors

The performance of many parallel applications depends on loop-level para...
research
07/16/2023

Arithmetic Deduction Model for High Performance Computing: A Comparative Exploration of Computational Models Paradigms

A myriad of applications ranging from engineering and scientific simulat...
research
02/25/2017

CHAOS: A Parallelization Scheme for Training Convolutional Neural Networks on Intel Xeon Phi

Deep learning is an important component of big-data analytic tools and i...
research
02/14/2018

A co-located partitions strategy for parallel CFD-DEM couplings

In this work, a new partition-collocation strategy for the parallel exec...

Please sign up or login with your details

Forgot password? Click here to reset