International Journal of Computer Applications
Many applications deal with huge amount of data and that scattered data needs to be transformed into something relevant and meaningful. To make sense of such data is the need of many applications and areas of technology. The data that is already present is very huge, noisy and has a complex structure. The authors are working on the idea of integrating data mining with data deduplication. Data Dashboard is a tool which can take complex data involving various dimensions and simultaneously uses data deduplication algorithms that help in removing redundancy in the data up to 95%.