International Journal of Advanced Research in Computer Science and Software Engineering (IJARCSSE)
Data mining is the process of analyzing data from different perspectives and summarizing it into useful information. Data are any facts, numbers or text that can be process by a computer. The patterns, associations or relationships among all collected data can provide information. Information can be converted into knowledge about historical patterns and future trends. Data mining is the extraction of useful data from the vast amount of data i.e. big data. Data mining algorithms decision tree C4.5 and Bayesian classifier are compared in this paper.