International Journal of Computer Science and Mobile Computing (IJCSMC)
Automatic increment in data is a difficult problem, as it requires the development of well-defined algorithms and a runtime system to support performance code. Many online data sets grow incrementally over time as new entries are slowly added and existing entries are deleted or updated to manage the dataset. The Hadoop is a dedicated distributed paradigm used to manipulate the large amount of distributed data. This manipulation contains not only storage but also the computation and processing of the data. Hadoop is normally used for data centric applications.