A Benchmark Suite for Unstructured Data Processing
Source: University of Virginia
A large fraction of the data that will stored and accessed in future systems is expected to be unstructured, in the form of images, audio files, etc. Therefore, it is very important to design future I/O subsystems to provide efficient storage, and access to these vast and continuously growing repositories of unstructured data. To facilitate system design and evaluation, the authors first need benchmarks that capture the processing and I/O access characteristics of applications that operate on unstructured data. In this paper, they present an unstructured data processing benchmark suite that they have developed.