International Journal of Advanced Research in Computer Science and Software Engineering (IJARCSSE)
The information on the web is growing dramatically and it is well known that over 80% of the time required to carry out any real world data mining project is usually spent on data pre-processing. Data pre-processing lays the groundwork for data mining. Before the discovery of useful information/knowledge, the target data set must be properly prepared. But it is unfortunately ignored by most researchers on data mining due to its perceived difficulty. This paper describes an efficient approach for data pre-processing for mining Web based user data in order to speed up the data preparation process.