Web Content Extraction to Facilitate Web Mining
Internet continuously strives to become the prime source of knowledge and Information, used in almost every sphere of life. As the volume and complexity of the Information shared on WEB is increasing, various forms of representation of this data has been emerged. In order to deal with different forms of data, different technologies have been discovered to efficiently provide the Information to the end users. With advent of such technologies the web content is reforming from simple HTML pages to highly complex, sophisticated bunch of data representation. A web page typically contains a mixture of many kind of information e.g., main contains, advertisements, navigational panels, copyright blocks etc.