Date Added: Feb 2013
In the heterogeneous parallel and distributed computing environments like cloud there were many related approaches proposed for fault tolerant execution of workflows. Most of the earlier works involved does not depend on failure prediction of the resources that is really hard to achieve with the tracing of historic failure data over years of the desired environment. In this paper, to solve the software fault prediction, unavailability of the resources and monitoring problems the authors propose a failure prediction model that involves two different methods. In order to predict the failures at the nodes they propose a method using Intelligent Platform Management Interface (IPMI) that monitor the failure at nodes and provide the respective data that is useful for determining likely imminent failures.