International Journal of Computer Science Issues
Data cleaning is a process of correcting or removing of erroneous data caused by contradictions, disparities, keying mistakes, missing bits, etc to create consistent and reliable information. Text files are used to store simple information and which can be also deceptive in terms of dirty data. In this paper the authors have provided a solution to cleanup simple text file using some data cleaning processes. Though they use text files so often but there is no such robust method exist to clean up text files.