A Learned Cache Eviction Framework with Minimal Overhead

01/27/2023
by   Dongsheng Yang, et al.
0

Recent work shows the effectiveness of Machine Learning (ML) to reduce cache miss ratios by making better eviction decisions than heuristics. However, state-of-the-art ML caches require many predictions to make an eviction decision, making them impractical for high-throughput caching systems. This paper introduces Machine learning At the Tail (MAT), a framework to build efficient ML-based caching systems by integrating an ML module with a traditional cache system based on a heuristic algorithm. MAT treats the heuristic algorithm as a filter to receive high-quality samples to train an ML model and likely candidate objects for evictions. We evaluate MAT on 8 production workloads, spanning storage, in-memory caching, and CDNs. The simulation experiments show MAT reduces the number of costly ML predictions-per-eviction from 63 to 2, while achieving comparable miss ratios to the state-of-the-art ML cache system. We compare a MAT prototype system with an LRU-based caching system in the same setting and show that they achieve similar request rates.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/22/2021

KML: Using Machine Learning to Improve Storage Systems

Operating systems include many heuristic algorithms designed to improve ...
research
05/02/2018

ECI-Cache: A High-Endurance and Cost-Efficient I/O Caching Scheme for Virtualized Platforms

In recent years, high interest in using Virtual Machines (VMs) in data c...
research
06/11/2021

A New Upper Bound on Cache Hit Probability for Non-anticipative Caching Policies

Caching systems have long been crucial for improving the performance of ...
research
09/19/2023

Ditto: An Elastic and Adaptive Memory-Disaggregated Caching System

In-memory caching systems are fundamental building blocks in cloud servi...
research
12/18/2021

Multi-step LRU: SIMD-based Cache Replacement for Lower Overhead and Higher Precision

A key-value cache is a key component of many services to provide low-lat...
research
04/14/2020

Model and Machine Learning based Caching and Routing Algorithms for Cache-enabled Networks

In-network caching is likely to become an integral part of various netwo...
research
04/06/2022

SqueezeNeRF: Further factorized FastNeRF for memory-efficient inference

Neural Radiance Fields (NeRF) has emerged as the state-of-the-art method...

Please sign up or login with your details

Forgot password? Click here to reset