Simplifying MapReduce Data Processing
MapReduce is a programming model developed by Google for processing and generating large data sets in distributed environments. Many real-world tasks can be implemented by two functions, map and reduce. MapReduce plays a key role in Cloud Computing, since it decreases the complexity of the distributed programming and is easy to be developed on large clusters of common machines. Hadoop, an open-source project, is used to implement Google MapReduce architecture. It is wildly used by many applications such as FaceBook, Yahoo, Twitter, and so on. However, it is difficult to decouple an application into functions of map and reduce for common users.