Dimensionality Reduction in Web Page Classification
Internet provides millions of web pages for each and every search term and it is a powerful medium for communication between computers and accessing online documents but tools like search engines assist users in locating and organizing information. Web page classification is one of the essential techniques for web mining because classifying web pages of an interesting class is often the first step of mining the web. Web page classification is generally used in supervised learning. Web page classification technique is suffering from the high dimensionality. High dimensionality is the major problem and it takes high computational power and large memory to run any classification algorithm.