NV-Process: A Fault-Tolerance Process Model Based on Non-Volatile Memory

Reliability wall is one of the most challenging problems for next generation High Performance Computing (HPC) systems. Traditional system design adopts extra fault tolerance mechanism. However, the cost of fault tolerance mechanism itself may incur huge cost, so as to decrease the utilization ratio of the HPC system. To address this problem, the authors present NV-process, a fault-tolerance process model based on NVRAM. NV-process instances run in a self-contained way in NVRAM, thus to survive across operating system reboot. NV-process provides an elegant way for the applications to tolerate system crashes.

Provided by: Association for Computing Machinery Topic: Storage Date Added: Jul 2012 Format: PDF

Find By Topic