Fast Modeling L2 Cache Reuse Distance Histograms Using Combined Locality Information from Software Traces

07/11/2019
by   Ming Ling, et al.
0

To mitigate the performance gap between the CPU and the main memory, multi-level cache architectures are widely used in modern processors. Therefore, modeling the behaviors of the downstream caches becomes a critical part of the processor performance evaluation in the early stage of Design Space Exploration (DSE). In this paper, we propose a fast and accurate L2 cache reuse distance histogram model, which can be used to predict the behaviors of the multi-level cache architectures where the L1 cache uses the LRU replacement policy and the L2 cache uses LRU/Random replacement policies. We use the L1 reuse distance histogram and two newly proposed metrics, namely the RST table and the Hit-RDH, that describing more detailed information of the software traces as the inputs. The output of our model is the L2 cache reuse distance histogram, based on which the L2 cache miss rates can be evaluated. We compare the L2 cache miss rates with the results from gem5 cycle-accurate simulations of 15 benchmarks chosen from SPEC2006. The average absolute error is less than 5 model into a multi-core architecture in which two cores share a unified L2 cache. The error of our model, in this case, is less than 7

READ FULL TEXT

page 7

page 8

page 10

research
09/10/2021

An Effective Early Multi-core System Shared Cache Design Method Based on Reuse-distance Analysis

In this paper, we proposed an effective and efficient multi-core shared-...
research
10/06/2016

Validating Simplified Processor Models in Architectural Studies

Cycle-accurate software simulation of multicores with complex microarchi...
research
09/10/2021

A Fast-and-Effective Early-Stage Multi-level Cache Optimization Method Based on Reuse-Distance Analysis

In this paper, we propose a practical and effective approach allowing de...
research
03/22/2010

Proficient Pair of Replacement Algorithms on L1 and L2 Cache for Merge Sort

Memory hierarchy is used to compete the processors speed. Cache memory i...
research
04/11/2021

PPT-Multicore: Performance Prediction of OpenMP applications using Reuse Profiles and Analytical Modeling

We present PPT-Multicore, an analytical model embedded in the Performanc...
research
01/06/2020

A Fast Analytical Model of Fully Associative Caches

While the cost of computation is an easy to understand local property, t...
research
03/19/2021

PPT-SASMM: Scalable Analytical Shared Memory Model: Predicting the Performance of Multicore Caches from a Single-Threaded Execution Trace

Performance modeling of parallel applications on multicore processors re...

Please sign up or login with your details

Forgot password? Click here to reset