International Journal of Advanced Research in Computer Science and Software Engineering (IJARCSSE)
The data extraction techniques from web help to extract knowledge from web data, in which at least one of structured data is used in the mining process. The information collected from HTML tags and all sources, gives the minimum accuracy of result at the time of merging the collection of similar data into tables. This technique needs highest accuracy for analysis of data collection from the web. In web information extraction, data annotation and data alignment are the major problems that still need solutions.