Main Content Extraction From Web Page Using Dom

Today internet has made the life of human dependent on it. Almost everything and anything can be searched on net. The rapid growth of World Wide Web has been tremendous in recent years. With the large amount of information on the Internet, web pages have been the potential source of information retrieval and data mining technology such as commercial search engines, web mining applications. Internet web pages contain several items that cannot be classified as the informative content, e.g., search and filtering panel, navigation links, advertisements, and so on called as noisy parts.

Provided by: International Journal of Advanced Research in Computer and Communication Engineering (IJARCCE) Topic: Big Data Date Added: Mar 2014 Format: PDF

Find By Topic