Exploring the Hidden Web: A Review
World Wide Web (WWW) is broadly divided into two categories. First is Surface web that contains 1% of information content of the web. Search engine crawl along this web to extract and index text from HTML documents on the websites, then make this information searchable through keywords. Second is Hidden web that contains 99% of information content of the web. Most of this information is contained in the backend databases and is not indexed by search engines. Thus, users are searching only through 1% of Web. However, these hidden web pages are dynamically created through search query interfaces.