Science & Engineering Research Support soCiety (SERSC)
Recently, XML (eXtensible Markup Language) is widely used as the de facto standard for data representation and exchanging in Internet. In 2006, office application groups such as OpenOffice.org and Microsoft office both adopted XML as the main data storage format. Historical XML documents often have tiny differences between versions, but are stored individual independent space, so the abilities for efficient storing historical office documents are become a growing issue. This paper introduces an efficient way to decompose multi-version XML documents and store effectively for advanced retrieving.