Demand-driven provisioning of Kubernetes-like resources in OSG

08/22/2023
by   Igor Sfiligoi, et al.
0

The OSG-operated Open Science Pool is an HTCondor-based virtual cluster that aggregates resources from compute clusters provided by several organizations. Most of the resources are not owned by OSG, so demand-based dynamic provisioning is important for maximizing usage without incurring excessive waste. OSG has long relied on GlideinWMS for most of its resource provisioning needs but is limited to resources that provide a Grid-compliant Compute Entrypoint. To work around this limitation, the OSG Software Team has developed a glidein container that resource providers could use to directly contribute to the OSPool. The problem of that approach is that it is not demand-driven, relegating it to backfill scenarios only. To address this limitation, a demand-driven direct provisioner of Kubernetes resources has been developed and successfully used on the NRP. The setup still relies on the OSG-maintained backfill container image but automates the provisioning matchmaking and successive requests. That provisioner has also been extended to support Lancium, a green computing cloud provider with a Kubernetes-like proprietary interface. The provisioner logic has been intentionally kept very simple, making this extension a low-cost project. Both NRP and Lancium resources have been provisioned exclusively using this mechanism for many months.

READ FULL TEXT
research
05/02/2022

Auto-scaling HTCondor pools using Kubernetes compute resources

HTCondor has been very successful in managing globally distributed, plea...
research
07/01/2018

A Data-Driven Approach to Dynamically Adjust Resource Allocation for Compute Clusters

Nowadays, data-centers are largely under-utilized because resource alloc...
research
07/06/2022

A Kubernetes 'Bridge' operator between cloud and external resources

Many scientific workflows require dedicated compute resources, including...
research
03/12/2021

A Risk-taking Broker Model to Optimise User Requests placement on On-demand and Contract VMs

Cloud providers offer end-users various pricing schemes to allow them to...
research
01/03/2014

A Framework for Creating a Distributed Rendering Environment on the Compute Clusters

This paper discusses the deployment of existing render farm manager in a...
research
05/17/2023

Defining a canonical unit for accounting purposes

Compute resource providers often put in place batch compute systems to m...
research
04/22/2023

Read-once machines and the thermodynamic complexity of Maxwell's demons

The thermodynamical costs imposed by computational resource limitations ...

Please sign up or login with your details

Forgot password? Click here to reset