Networking

Compressing Provenance Graphs

Free registration required

Executive Summary

The provenance community has built a number of systems to collect provenance, most of which assume that provenance will be retained indefinitely. However, it is not cost-effective to retain provenance information inefficiently. Since provenance can be viewed as a graph, the authors note the similarities to web graphs and draw upon techniques from the web compression domain to provide their own novel and improved graph compression solutions for provenance graphs. Their preliminary results show that adapting web compression techniques results in a compression ratio of 2.12:1 to 2.71:1, which they can improve upon to reach ratios of up to 3.31:1.

  • Format: PDF
  • Size: 205.1 KB