Improving Job Scheduling in Hadoop MapReduce

Provided by: Creative Commons
Topic: Big Data
Format: PDF
Hadoop is a framework for processing large amount of data in parallel with the help of Hadoop Distributed File System (HDFS) and MapReduce framework. Job scheduling is an important process in Hadoop MapReduce. MapReduce scheduler does not scale well in heterogeneous environment. As an extension of Hadoop scheduler, LATE MapReduce scheduling algorithm takes heterogeneous environment into consideration. However, its performance is much poor due to the static manner in which it computes progress of tasks. So, neither Hadoop nor LATE schedulers are desirable in heterogeneous environment.

Find By Topic