Fault Tolerance Management in IaaS Clouds

Fault tolerance, reliability and availability in Cloud computing are critical to ensure correct and continuous system operation also in the presence of failures. In this paper, the authors present an approach to evaluate fault tolerance mechanisms that use the virtualization technology to transparently increase the reliability and availability of applications deployed in the virtual machines in a Cloud. In contrast to several existing solutions that assume independent failures, they take into account the failure behavior of various server components, network and power distribution in a typical Cloud computing infrastructure, the correlation between individual failures, and the impact of each failure on user's applications.

Provided by: Institute of Electrical & Electronic Engineers Topic: Cloud Date Added: Oct 2012 Format: PDF

Find By Topic