Provided by: International Journal of Computer Applications
Topic: Big Data
Date Added: Jun 2014
Data mining has been widely adopted in recent years in many fields, especially in the medical field. This paper highlights the prediction of unknown primary tumors in the dataset. The multiclass classifier with random forest is used for classification of multiclass dataset as it gives much higher accuracy than binary classifiers. SMOTE method for this imbalanced dataset with randomize technique is applied during preprocessing for reducing the biasness among classes. These all evaluations and results are carried out using Weka 3.6.10 as a data mining tool.