Date Added: Feb 2012
The World Wide Web has huge amount of information that is retrieved using information retrieval tool like Search Engine. Page repository of Search Engine contains the web documents downloaded by the crawler. This repository contains variety of web documents from different domains. It becomes tedious for the user to manually extract real required information from this material. The detection of common and distinctive topics within a document set, together with the generation of multi-document summaries, can greatly ease the burden of information management. Cluster analysis divides objects into meaningful groups based on similarity between objects.