Structure and Semantics of Data-Intensive Web Pages: An Experimental Study on their Relationships

Download Now
Provided by: Journal of Universal Computer Science
Topic: Big Data
Format: PDF
In data-intensive web sites pages are generated by scripts that embed data from a backend database into HTML templates. There is usually a relationship between the semantics of the data in a page and its corresponding template. For example, in a web site about sports events, it is likely that pages with data about athletes are associated with a template that differs from the template used to generate pages about coaches or referees. This paper presents a method to classify web pages according to the associated template.
Download Now

Find By Topic