Challenges in Genomic Data Processing I - Multiple Small Files

Executive Summary

Advances in the field of Genomics have made more data on the very building blocks of life available than ever before. When coupled with the burgeoning fields of Bioinformatics and Statistical Genomics, there is significant information that has been heretofore hidden in those simple chemical bonds. Unfortunately, bridging the divide between the fields of life sciences and information technology is not always straightforward. Data produced by genetic technology can be messy, and there is often a great deal of it. Despite the increases in computing power and natural language processing, order still must be brought to the chaos before it can be analyzed.

