Moving the California distributed CMS xcache from bare metal into containers using Kubernetes

03/04/2020
by   Edgar Fajardo, et al.
0

The University of California system has excellent networking between all of its campuses as well as a number of other Universities in CA, including Caltech, most of them being connected at 100 Gbps. UCSD and Caltech have thus joined their disk systems into a single logical xcache system, with worker nodes from both sites accessing data from disks at either site. This setup has been in place for a couple years now and has shown to work very well. Coherently managing nodes at multiple physical locations has however not been trivial, and we have been looking for ways to improve operations. With the Pacific Research Platform (PRP) now providing a Kubernetes resource pool spanning resources in the science DMZs of all the UC campuses, we have recently migrated the xcache services from being hosted bare-metal into containers. This paper presents our experience in both migrating to and operating in the new environment.

READ FULL TEXT
research
05/12/2020

Demonstrating 100 Gbps in and out of the public Clouds

There is increased awareness and recognition that public Cloud providers...
research
04/17/2023

A Decentralized Authorization and Security Framework for Distributed Research Workflows

Research challenges such as climate change and the search for habitable ...
research
11/24/2018

MiniOS: an instructional platform for teaching operating systems labs

Delivering hands-on practice laboratories for introductory courses on op...
research
08/29/2018

Fair Marketplace for Secure Outsourced Computations

The cloud computing paradigm offers clients ubiquitous and on demand acc...
research
04/18/2020

Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scientific Computing

Scientific computing needs are growing dramatically with time and are ex...
research
03/15/2022

Cost-effective BlackWater Raft on Highly Unreliable Nodes at Scale Out

The Raft algorithm maintains strong consistency across data replicas in ...
research
01/03/2014

A Framework for Creating a Distributed Rendering Environment on the Compute Clusters

This paper discusses the deployment of existing render farm manager in a...

Please sign up or login with your details

Forgot password? Click here to reset