Fuzzy Ontology Based Schematic Web Crawling with Self Adaptive Clustering on Extraction

Provided by: Creative Commons
Topic: Data Management
Format: PDF
Due to the rapid growth of web pages in internet, discovering relevant content from the web is one of the main challenges in deep web crawling. Web crawlers play the vital role in search engines. Most of the web crawlers use the hyperlinks only for crawling. The html form urls and JavaScript based urls in the web pages can also have the relevant content for the keywords. While adding these urls also for crawling, the number of pages to be crawled will be increased which will increase the crawling time.

Find By Topic