Role of Concept Based Mining Model in Enhancing Web Document Clustering
Existing text mining techniques captures the importance of a term (word or phrase) using term frequency at document level only. The text mining techniques should capture the meaning or semantics of a text also. In this paper, a new concept-based mining system is introduced which analyzes the terms on sentence, document, and corpus levels. This model is capable of differentiating between important and non important terms with respect to the semantics or meaning of the sentence. The term which contributes to the meaning/semantics of the sentence is further analyzed at sentence, document and corpus levels rather than traditional analysis of document only.