Text Document Clustering in Distributed Peer Networks
Text mining is a way to identify new, previously unknown information by applying techniques from natural language processing and data mining. But most of the techniques for text mining are based on statistical analysis for a word or phrase. If two terms have the same frequency in the documents, then the authors have to select the exact word which has more meaningful to the word or phrase. To analyze the terms in the document or a sentence, several techniques have been presented. A concept based model is one of the approach can effectively differentiate the non important terms with respect to the sentence semantics. According to the semantics of the sentence, the concept based documents have been clustered.