CLUMARE: A GUI Based Tool for Clustering Multidimensional Datasets Using Map Reduce
Data clustering is a challenging problem due to the complex and heterogeneous natures of multidimensional data. On the other hand very few clustering methods can successfully deal with the multi-dimensional datasets and it becomes even hard to handle such large amounts of data. For datasets that don't possible to store even on a single disk, parallelism is an excellent option. MapReduce is a programming framework to process large scale data in a massively parallel way.