The term big data is extensively used in many computational and decision making domains. Big data is nothing but the large data sets formed from various sources and are almost impossible to process and analyze using traditional approaches because of its complexity. Efficient analysis and processing of big data within a given time frame is essential for it to be useful. Various technologies like Hadoop, MapReduce, etc. are used to analyze the big data and hence possible to retrieve knowledge from the large datasets.