Parsing of HTML Document
The Websites are an important source of data now days. There have been different types of information available on it. This information can be extremely beneficial for users. Extracting information from internet is challenging issue. However the amount of human interaction that is currently required for this is inconvenient. So, the objective of this paper is try to solve this problem by making the task as atomic as possible. Existing methods addressing the problem can be classified into three categories. Methods in the first category provide some languages to facilitate the construction of data extraction systems.