International Journal of Research In Advanced Engineering Technologies (IJRAET)
Extracting information from web documents is a very hard task, and is going to become more and more critical as the amount of digital information available on the internet grows. Indeed, documents are often so large that the dataset returned as answer to a query may be too big to convey interpretable knowledge. As the maintenance of several databases is a difficult task. In this paper, the authors are storing their web documents in an XML format and are easy to search for a query.