International Journal of Engineering and Innovative Technology (IJEIT)
The people are living in on-demand digital universe with data spread by users and organizations at a very high rate. This data is categorized as big data because of its variety, velocity, veracity and volume. This data is again classified into unstructured, semi-structured and structured. Large datasets require special processing systems; it is a unique challenge for academicians and researchers. Map Reduce jobs use efficient data processing techniques which are applied in every phases of Map Reduce such as mapping, combining, shuffling, indexing, grouping and reducing. Big data has essential characteristics as follows variety, volume, velocity, viscosity and virality.