International Journal of Research in Advent Technology (IJRAT)
Distributed Data Mining (DDM) is a process to extract globally interesting associations, classifiers, clusters and other patterns from distributed data. As datasets double in size every year, moving the data repeatedly to distant CPUs brings about high communication cost. In this paper, the authors used data cloud to implement DDM in order to move the data rather than moving the entire computation. MapReduce is a software model for implementing data-centric distributed computing. Initially, a kind of cloud system architecture for DDM is proposed.