Taking the OXPath Down the Deep Web
Although deep web analysis has been studied extensively, there is no succinct formalism to describe user interactions with AJAX-enabled web applications. Toward this end, the authors introduce OXPath as a superset of XPath 1.0. Beyond XPath, OXPath is able to fill web forms and trigger DOM events, to access dynamically computed CSS attributes, to navigate between visible form fields, and to mark relevant information for extraction. This way, OXPath expressions can closely simulate the human interaction relevant for navigation rather than rely exclusively on the HTML structure. Thus, they are quite resilient against technical changes.