International forum of researchers Students and Academician
Text documents in the form of digital data are rapidly increasing. Manually analyzing such data is a tiresome task. Data mining techniques have been considered to analyze such data and generate interesting patterns. Many existing techniques use term-based methods. Some term-based approaches are Rocchio and probabilistic models, BM25, rough set models and Support Vector Machine (SVM). The advantages of term-based methods are that they include efficient computational performance and term weighting, which have developed over the last decade.