Universal Caching

05/10/2022
by   Ativ Joshi, et al.
0

In the learning literature, the performance of an online policy is commonly measured in terms of the static regret metric, which compares the cumulative loss of an online policy to that of an optimal benchmark in hindsight. In the definition of static regret, the benchmark policy remains fixed throughout the time horizon. Naturally, the resulting regret bounds become loose in non-stationary settings where fixed benchmarks often suffer from poor performance. In this paper, we investigate a stronger notion of regret minimization in the context of an online caching problem. In particular, we allow the action of the offline benchmark at any round to be decided by a finite state predictor containing arbitrarily many states. Using ideas from the universal prediction literature in information theory, we propose an efficient online caching policy with an adaptive sub-linear regret bound. To the best of our knowledge, this is the first data-dependent regret bound known for the universal caching problem. We establish this result by combining a recently-proposed online caching policy with an incremental parsing algorithm, e.g., Lempel-Ziv '78. Our methods also yield a simpler learning-theoretic proof of the improved regret bound as opposed to the more involved and problem-specific combinatorial arguments used in the earlier works.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/22/2019

Learning to Cache With No Regrets

This paper introduces a novel caching analysis that, contrary to prior w...
research
03/04/2015

Hierarchies of Relaxations for Online Prediction Problems with Evolving Constraints

We study online prediction where regret of the algorithm is measured aga...
research
03/31/2020

Fundamental Limits of Online Network-Caching

Optimal caching of files in a content distribution network (CDN) is a pr...
research
04/20/2022

Online Caching with no Regret: Optimistic Learning via Recommendations

The design of effective online caching policies is an increasingly impor...
research
06/30/2020

Dynamic Regret of Policy Optimization in Non-stationary Environments

We consider reinforcement learning (RL) in episodic MDPs with adversaria...
research
09/17/2020

Caching in Networks without Regret

We consider the online problem where n users are connected to m caches ...
research
04/03/2020

Universal Algorithms: Beyond the Simplex

The bulk of universal algorithms in the online convex optimisation liter...

Please sign up or login with your details

Forgot password? Click here to reset