Building Resilient Cloud Over Unreliable Commodity Infrastructure

by   Piyus Kedia, et al.

Cloud Computing has emerged as a successful computing paradigm for efficiently utilizing managed compute infrastructure such as high speed rack-mounted servers, connected with high speed networking, and reliable storage. Usually such infrastructure is dedicated, physically secured and has reliable power and networking infrastructure. However, much of our idle compute capacity is present in unmanaged infrastructure like idle desktops, lab machines, physically distant server machines, and laptops. We present a scheme to utilize this idle compute capacity on a best-effort basis and provide high availability even in face of failure of individual components or facilities. We run virtual machines on the commodity infrastructure and present a cloud interface to our end users. The primary challenge is to maintain availability in the presence of node failures, network failures, and power failures. We run multiple copies of a Virtual Machine (VM) redundantly on geographically dispersed physical machines to achieve availability. If one of the running copies of a VM fails, we seamlessly switchover to another running copy. We use Virtual Machine Record/Replay capability to implement this redundancy and switchover. In current progress, we have implemented VM Record/Replay for uniprocessor machines over Linux/KVM and are currently working on VM Record/Replay on shared-memory multiprocessor machines. We report initial experimental results based on our implementation.


Efficient image deployment in cloud environments

The biggest overhead for the instantiation of a virtual machine in a clo...

Cloud Scheduler: a resource manager for distributed compute clouds

The availability of Infrastructure-as-a-Service (IaaS) computing clouds ...

Securing Automotive Architectures with Named Data Networking

As in-vehicle communication becomes more complex, the automotive communi...

Failure Aware Semi-Centralized Virtual Network Embedding in Cloud Computing Fat-Tree Data Center Networks

In Cloud Computing, the tenants opting for the Infrastructure as a Servi...

Clustering-based Anomaly Detection for microservices

Anomaly detection is an important step in the management and monitoring ...

V-BOINC: The Virtualization of BOINC

The Berkeley Open Infrastructure for Network Computing (BOINC) is an ope...

Memory Leak Detection Algorithms in the Cloud-based Infrastructure

A memory leak in an application deployed on the cloud can affect the ava...

Please sign up or login with your details

Forgot password? Click here to reset