VPC: Scalable, Low Downtime Checkpointing for Virtual Clusters

Provided by: Virginia Systems
Topic: Big Data
Format: PDF
A Virtual Cluster (VC) consists of multiple Virtual Machines (VMs) running on different physical hosts, interconnected by a virtual network. A fault-tolerant protocol and mechanism are essential to the VC's availability and usability. The authors present Virtual Predict Checkpointing (VPC), a lightweight, globally consistent checkpointing mechanism, which checkpoints the VC for immediate restoration after VM failures. By predicting the checkpoint-caused page faults during each checkpointing interval, VPC further reduces the solo VM downtime than traditional incremental checkpointing approaches.

Find By Topic