Date Added: Sep 2012
Text documents are terribly important within the modern organizations; furthermore their constant accumulation enlarges the scope of document storage. Customary text mining and knowledge retrieval techniques of text document sometimes think about word matching. An alternate method of knowledge retrieval is agglomeration. During which document pre-processing is a vital and important step within the agglomeration method and it's a large impact on the success of an information mining project. The agglomeration method involves reading the text documents from the disk, and preprocesses them to create vector house model.