Policy/mechanism separation in the Warehouse-Scale OS

03/17/2023
by   mark-mansi, et al.
0

"As many of us know from bitter experience, the policies provided in extant operating systems, which are claimed to work well and behave fairly 'on the average', often fail to do so in the special cases important to us" [Wulf et al. 1974]. Written in 1974, these words motivated moving policy decisions into user-space. Today, as warehouse-scale computers (WSCs) have become ubiquitous, it is time to move policy decisions away from individual servers altogether. Built-in policies are complex and often exhibit bad performance at scale. Meanwhile, the highly-controlled WSC setting presents opportunities to improve performance and predictability. We propose moving all policy decisions from the OS kernel to the cluster manager (CM), in a new paradigm we call Grape CM. In this design, the role of the kernel is reduced to monitoring, sending metrics to the CM, and executing policy decisions made by the CM. The CM uses metrics from all kernels across the WSC to make informed policy choices, sending commands back to each kernel in the cluster. We claim that Grape CM will improve performance, transparency, and simplicity. Our initial experiments show how the CM can identify the optimal set of huge pages for any workload or improve memcached latency by 15

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/26/2015

EOS: Automatic In-vivo Evolution of Kernel Policies for Better Performance

Today's monolithic kernels often implement a small, fixed set of policie...
research
09/22/2021

Safe Policy Learning through Extrapolation: Application to Pre-trial Risk Assessment

Algorithmic recommendations and decisions have become ubiquitous in toda...
research
04/09/2020

Efficient Kernel Object Management for Tiered Memory Systems with KLOC

Software-controlled heterogeneous memory systems have the potential to i...
research
08/31/2020

Ranking Policy Decisions

Policies trained via Reinforcement Learning (RL) are often needlessly co...
research
09/12/2017

Information Design in Crowdfunding under Thresholding Policies

In crowdfunding, an entrepreneur often has to decide how to disclose the...
research
05/04/2020

Dim Silicon and the Case for Improved DVFS Policies

Due to thermal and power supply limits, modern Intel CPUs reduce their f...
research
03/26/2021

Composable Learning with Sparse Kernel Representations

We present a reinforcement learning algorithm for learning sparse non-pa...

Please sign up or login with your details

Forgot password? Click here to reset