Predicting Primary Tumors using Multiclass Classifier Approach of Data Mining

Data mining has been widely adopted in recent years in many fields, especially in the medical field. This paper highlights the prediction of unknown primary tumors in the dataset. The multiclass classifier with random forest is used for classification of multiclass dataset as it gives much higher accuracy than binary classifiers. SMOTE method for this imbalanced dataset with randomize technique is applied during preprocessing for reducing the biasness among classes. These all evaluations and results are carried out using Weka 3.6.10 as a data mining tool.

Provided by: International Journal of Computer Applications Topic: Big Data Date Added: Jun 2014 Format: PDF

Find By Topic