Practical Bounds on Optimal Caching with Variable Object Sizes

11/10/2017
by   Daniel S. Berger, et al.
0

Many recent caching systems aim to improve hit ratios, but there is no good sense among practitioners of how much further hit ratios can be improved. In other words, should the systems community continue working on this problem? Currently, there is no principled answer to this question. Most prior work assumes that objects have the same size, but in practice object sizes often vary by several orders of magnitude. The few known results for variable object sizes provide very weak guarantees and are impractical to compute on traces of realistic length. We propose a new method to compute the offline optimal hit ratio under variable object sizes. Our key insight is to represent caching as a min-cost flow problem, hence we call our method the flow-based offline optimal (FOO). We show that, under simple independence assumptions and Zipf popularities, FOO's bounds become tight as the number of objects goes to infinity. From FOO we develop fast, practical methods to compute nearly tight bounds for the optimal hit ratio, which we call practical flow-based offline optimal (P-FOO). P-FOO enables the first analysis of optimal caching on realistic traces with hundreds of millions of requests. We evaluate P-FOO on several production traces, where results show that recent caching systems are still far from optimal.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/11/2021

A New Upper Bound on Cache Hit Probability for Non-anticipative Caching Policies

Caching systems have long been crucial for improving the performance of ...
research
12/09/2019

Similarity Caching: Theory and Algorithms

This paper focuses on similarity caching systems, in which a user reques...
research
09/07/2022

Computing the Hit Rate of Similarity Caching

Similarity caching allows requests for an item i to be served by a simil...
research
08/03/2021

Optimally Hiding Object Sizes with Constrained Padding

Among the most challenging traffic-analysis attacks to confound are thos...
research
02/09/2021

Content Placement in Networks of Similarity Caches

Similarity caching systems have recently attracted the attention of the ...
research
06/03/2023

On Optimal Caching and Model Multiplexing for Large Model Inference

Large Language Models (LLMs) and other large foundation models have achi...
research
05/02/2022

Tightness without Counterexamples: A New Approach and New Results for Prophet Inequalities

Prophet inequalities consist of many beautiful statements that establish...

Please sign up or login with your details

Forgot password? Click here to reset