International Journal of Computer and Electrical Engineering (IJCEE)
In this paper, an architecture with a load balancing model and a fault tolerant model for virtual shared memory clusters is proposed. The centralized dynamic load balancing model uses manager worker concept with a virtualized compute server. The virtual server can be expanded \"On the fly\" for load balancing by adding virtual machines temporarily. The fault tolerant model controlled by a virtual machine monitor describes the checkpointing based recovery methods. The performance evaluation results show that the proposed system achieves significant speedup in terms of execution time and checkpoint time.