DEUDS: Data Extraction Using DOM Tree and Selectors

Download Now
Provided by: Creative Commons
Topic: Big Data
Format: PDF
Web data analysis applications such as extracting mutual funds information from a website, daily extracting opening and closing price of stock from a web page involves web data extraction. Every time the users need analyze data, they need to visit number of web sites. It is very time consuming process to construct wrapper to visit those sites and collect data. In this paper, the authors propose technique called DEUDS, a page level data extraction system that automatically discovers extraction pattern from web pages for selected data section and extracts data.
Download Now

Find By Topic