Extraction of Contextual Relevance of Web Documents
The crawled web pages should be organized in a fashion where they are more understandable to machine, for producing the results which are meaningful and relevant. The set of web pages can be categorized into different contextual sense if the crawler has the technique to understand their meaning and the domain identification. The contextual relevance of the web documents can be known, if the frequent occurring patterns of the keywords in the web page are identified. This can be achieved through data mining technique for generating frequent patterns, using FP- Growth.