As the usage of information technology has increased in the world, the data generation from various resources has unexpectedly increased. The technology for handling the vast amount of data has not developed as compared to the data generation. Traditional database systems are unable to handle the increased volume of data due to its volume, variety, complexity and variability. To deal with this problem, Hadoop Distributed File System (HDFS) like technology is developed. The data to be processed exists in different format that is why the traditional relational database management system is suitable for the big data.