International Journal of Computer Technology and Electronics Engineering
Web databases contain a huge amount of structured data which are easily obtained via their query interfaces only. Query results are presented in dynamically generated web pages, usually in the form of data records, for human use. Decisive for web data integration applications is the problem of automatically extracting data records from query result pages, such as comparison shopping sites, meta-search engines, etc. A number of approaches to query result extraction have been proposed. As the structures of web pages become more critical, these approaches start to fail.