Practical Data Compression for Modern Memory Hierarchies

09/07/2016
by   Gennady Pekhimenko, et al.
0

In this thesis, we describe a new, practical approach to integrating hardware-based data compression within the memory hierarchy, including on-chip caches, main memory, and both on-chip and off-chip interconnects. This new approach is fast, simple, and effective in saving storage space. A key insight in our approach is that access time (including decompression latency) is critical in modern memory hierarchies. By combining inexpensive hardware support with modest OS support, our holistic approach to compression achieves substantial improvements in performance and energy efficiency across the memory hierarchy. Using this new approach, we make several major contributions in this thesis. First, we propose a new compression algorithm, Base-Delta-Immediate Compression (BDI), that achieves high compression ratio with very low compression/decompression latency. BDI exploits the existing low dynamic range of values present in many cache lines to compress them to smaller sizes using Base+Delta encoding. Second, we observe that the compressed size of a cache block can be indicative of its reuse. We use this observation to develop a new cache insertion policy for compressed caches, the Size-based Insertion Policy (SIP), which uses the size of a compressed block as one of the metrics to predict its potential future reuse. Third, we propose a new main memory compression framework, Linearly Compressed Pages (LCP), that significantly reduces the complexity and power cost of supporting main memory compression. We demonstrate that any compression algorithm can be adapted to fit the requirements of LCP, and that LCP can be efficiently integrated with the existing cache compression designs, avoiding extra compression/decompression.

READ FULL TEXT
research
05/20/2017

Cache Hierarchy Optimization

Power consumption, off-chip memory bandwidth, chip area and Network on C...
research
09/01/2022

Hermes: Accelerating Long-Latency Load Requests via Perceptron-Based Off-Chip Load Prediction

Long-latency load requests continue to limit the performance of high-per...
research
01/03/2022

Energy-efficient Non Uniform Last Level Caches for Chip-multiprocessors Based on Compression

With technology scaling, the size of cache systems in chip-multiprocesso...
research
01/19/2019

Surface Compression Using Dynamic Color Palettes

Off-chip memory traffic is a major source of power and energy consumptio...
research
04/07/2022

Forecasting lifetime and performance of a novel NVM last-level cache with compression

Non-volatile memory (NVM) technologies are interesting alternatives for ...
research
06/15/2020

Addressing Variability in Reuse Prediction for Last-Level Caches

Last-Level Cache (LLC) represents the bulk of a modern CPU processor's t...
research
02/10/2020

A Computational Approach to Packet Classification

Multi-field packet classification is a crucial component in modern softw...

Please sign up or login with your details

Forgot password? Click here to reset