Understanding the Interactions of Workloads and DRAM Types: A Comprehensive Experimental Study

02/20/2019
by   Saugata Ghose, et al.
0

It has become increasingly difficult to understand the complex interaction between modern applications and main memory, composed of DRAM chips. Manufacturers are now selling and proposing many different types of DRAM, with each DRAM type catering to different needs (e.g., high throughput, low power, high memory density). At the same time, the memory access patterns of prevalent and emerging workloads are rapidly diverging, as these applications manipulate larger data sets in very different ways. As a result, the combined DRAM-workload behavior is often difficult to intuitively determine today, which can hinder memory optimizations in both hardware and software. In this work, we identify important families of workloads, as well as prevalent types of DRAM chips, and rigorously analyze the combined DRAM-workload behavior. To this end, we perform a comprehensive experimental study of the interaction between nine different DRAM types and 115 modern applications and multiprogrammed workloads. We draw 12 key observations from our characterization, enabled in part by our development of new metrics that take into account contention between memory requests due to hardware design. Notably, we find that (1) newer DRAM technologies such as DDR4 and HMC often do not outperform older technologies such as DDR3, due to higher access latencies and, in the case of HMC, poor exploitation of locality; (2) there is no single memory type that can cater to all of the SoC accelerators (e.g., GDDR5 significantly outperforms other memories for multimedia, while HMC significantly outperforms other memories for networking); and (3) there is still a strong need to lower DRAM latency, but unfortunately the current design trend of commodity DRAM is toward higher latencies to obtain other benefits. We hope that the trends we identify can drive optimizations in both hardware and software design.

READ FULL TEXT

page 6

page 7

page 8

research
05/26/2020

CLR-DRAM: A Low-Cost DRAM Architecture Enabling Dynamic Capacity-Latency Trade-Off

DRAM is the prevalent main memory technology, but its long access latenc...
research
05/22/2018

Storage and Memory Characterization of Data Intensive Workloads for Bare Metal Cloud

As the cost-per-byte of storage systems dramatically decreases, SSDs are...
research
12/17/2022

Accurate Open-set Recognition for Memory Workload

How can we accurately identify new memory workloads while classifying kn...
research
11/23/2019

Arsenal of Hardware Prefetchers

Hardware prefetching is one of the latency tolerance optimization techni...
research
07/31/2018

Emulating Hybrid Memory on NUMA Hardware

Non-volatile memory (NVM) has the potential to disrupt the boundary betw...
research
05/08/2018

Voltron: Understanding and Exploiting the Voltage-Latency-Reliability Trade-Offs in Modern DRAM Chips to Improve Energy Efficiency

This paper summarizes our work on experimental characterization and anal...
research
03/20/2018

Integrating DRAM Power-Down Modes in gem5 and Quantifying their Impact

Across applications, DRAM is a significant contributor to the overall sy...

Please sign up or login with your details

Forgot password? Click here to reset