International Journal of Advanced Research in Computer Science and Software Engineering (IJARCSSE)
In the today's era of technology storing and processing a data is a very important aspect. Now-a-days, even tera bytes and penta bytes of data is not sufficient for the storage of large chunk of database. For that reason companies today use the concept of Hadoop in their application. Also large amount of data warehouses are unable to satisfy the needs of data storage. Hadoop is designed to store large quantity of data sets reliably. In this paper, the authors describe big data and Apache Hadoop the most widely used software framework which supports data intensive distributed application.