Debellor: Open Source Modular Platform for Scalable Data Mining

Source: Warsaw University

Favorite

Free registration required

This paper introduces Debellor (www.debellor.org) - an open source extensible data mining platform with stream-oriented architecture, where all data transfers between elementary algorithms take the form of a stream of samples. Data streaming enables implementation of scalable algorithms, which can efficiently process large volumes of data, exceeding available memory. This is very important for data mining research and applications, since the most challenging data mining tasks involve voluminous data, either produced by a data source or generated at some intermediate stage of a complex data processing network. Advantages of data streaming are illustrated by experiments with clustering time series.
Format:PDF Size:210.20
Date:May 2009