An Evaluation of the Cost and Performance of Scientific Workflows on Amazon EC2

Workflows are used to orchestrate data-intensive applications in many different scientific domains. Workflow applications typically communicate data between processing steps using intermediate files. When tasks are distributed, these files are either transferred from one computational node to another, or accessed through a shared storage system. As a result, the efficient management of data is a key factor in achieving good performance for workflow applications in distributed environments. In this paper, the authors investigate some of the ways in which data can be managed for workflows in the cloud.

Provided by: University of Northern Iowa Topic: Cloud Date Added: Feb 2012 Format: PDF

Find By Topic