New Thread Migration Strategies for NUMA Systems

09/28/2018
by   O. G. Lorenzo, et al.
0

Multicore systems present on-board memory hierarchies and communication networks that influence performance when executing shared memory parallel codes. Characterising this influence is complex, and understanding the effect of particular hardware configurations on different codes is of paramount importance. In previous works, monitoring information extracted from hardware counters at runtime has been used to characterise the behaviour of each thread in the parallel code in terms of the number of floating point operations per second, operational intensity, and latency of memory access. We propose to use this information to guide thread migration strategies that improve execution efficiency by increasing locality and affinity. Different configurations of NAS Parallel OpenMP benchmarks on multicores were used to validate the benefits of the proposed thread migration strategies. Our proposed strategies produce up to 70 small degradation in performance for codes with high locality and affinity.

READ FULL TEXT

page 20

page 21

page 22

page 23

page 24

page 25

research
02/18/2022

Migration-Based Synchronization

A fundamental challenge in multi- and many-core systems is the correct e...
research
03/12/2020

Characterizing Optimizations to Memory Access Patterns using Architecture-Independent Program Features

High-performance computing developers are faced with the challenge of op...
research
05/30/2022

CTR: Checkpoint, Transfer, and Restore for Secure Enclaves

Hardware-based Trusted Execution Environments (TEEs) are becoming increa...
research
12/21/2022

Code Vectorization and Sequence of Accesses Strategies for Monolith Microservices Identification

Migrating a monolith application into a microservices architecture can b...
research
06/03/2022

Thread and Data Mapping in Software Transactional Memory: An Overview

In current microarchitectures, due to the complex memory hierarchies and...
research
07/15/2023

The Impact of Space-Filling Curves on Data Movement in Parallel Systems

Modern computer systems are characterized by deep memory hierarchies, co...
research
11/02/2019

Seismic Imaging: An Overview and Parallel Implementation of Poststack Depth Migration

Seismic migration is the core step of seismic data processing which is i...

Please sign up or login with your details

Forgot password? Click here to reset