X-CSR: Dataflow Optimization for Distributed XML Process Pipelines

Download Now Free registration required

Executive Summary

XML process networks are a simple, yet powerful programming paradigm for loosely coupled, coarse-grained dataflow applications such as data-centric scientific workflows. This paper describes a framework called ? -XML that is well-suited for applications in which pipelines of data processors modify parts ("Deltas") of XML data collections while keeping the overall collection structure intact. This paper shows how to optimize the execution of ?-XML process networks by minimizing the data shipping cost in distributed settings.

  • Format: PDF
  • Size: 186.9 KB