Data Management

Archiving Scientific Data

Free registration required

Executive Summary

Archiving is important for scientific data, where it is necessary to record all past versions of a database in order to verify findings based upon a specific version. Much scientific data is held in a hierarchical format and has a key structure that provides a canonical identification for each element of the hierarchy. In this paper, the authors exploit these properties to develop an archiving technique that is both efficient in its use of space and preserves the continuity of elements through versions of the database, something that is not provided by traditional minimum-edit-distance diff approaches. The approach also uses timestamps.

  • Format: PDF
  • Size: 490.2 KB