Provided by: International Journal of Engineering Trends and Technology
Topic: Big Data
In data mining, an important goal is to generate efficient data. To analyze data efficiently data mining uses horizontal tabular layout. There are three fundamental methods to evaluate horizontal layout, they are SPJ, CASE and PIVOT which has its own advantages. Preparing datasets in data mining requires many SQL queries for joining tables and aggregated columns. In Data mining projects, Classification is one of the most significant tasks which consumes more time. This paper presents an efficient implementation technique for SQL by using optimized C4.5 algorithm to perform classification. By using optimized C4.5 algorithm the authors can handle high dimensional records with minimal time.