Date Added: Aug 2012
Although MapReduce, the core technology of cloud computing, lowers the barriers to enter the parallel computing, it introduces the other challenging research issue of improving its performance via properly resource provisioning. This issue is more complex in a heterogeneous cloud with multiple jobs since the nodes have various capability and workloads. In addition, the limited resources must be shared among all jobs. In this paper, this optimization problem, called Node Capability-aware Provisioning Problem (NCPP), is first formulated as a mathematical model. The purpose of NCPP is to minimize the job execution time which is influenced by node capability. However, NCPP is subject to the resource constraints on the nodes in a cloud.