Provided by:
Virginia Systems
Topic:
Big Data
Format:
PDF
A Virtual Cluster (VC) consists of multiple Virtual Machines (VMs) running on different physical hosts, interconnected by a virtual network. A fault-tolerant protocol and mechanism are essential to the VC's availability and usability. The authors present Virtual Predict Checkpointing (VPC), a lightweight, globally consistent checkpointing mechanism, which checkpoints the VC for immediate restoration after VM failures. By predicting the checkpoint-caused page faults during each checkpointing interval, VPC further reduces the solo VM downtime than traditional incremental checkpointing approaches.