Classification of Large Datasets Using Random Forest Algorithm in Various Applications: Survey

Provided by: International Journal of Engineering and Innovative Technology (IJEIT)
Topic: Data Management
Format: PDF
Random forest is an ensemble of classification algorithm widely used in much application especially with larger datasets because of its outstanding features like variable importance measure, OOB error detection, proximity among the feature and handling of imbalanced datasets. This paper discusses many applications which use random forest to classify the dataset like network intrusion detection, e-mail spam detection, gene classification, credit card fraud detection and text classification. In this paper, each application is briefly introduced and then the dataset used for implementation is discussed and finally the real implementation of random forest algorithm with steps wise procedure and also the results are discussed.

Find By Topic