NGS Workflow Optimization Using a Hybrid Cloud Infrastructure
E-science applications involve great deal of data, to satisfy these processing requests, distributed computing paradigms, such as cluster, Grid, Virtual Grid, Cloud Computing, and Hybrid System are growing exponentially. Existing computing infrastructures, software system design, and use cases have to take into account the enormity in volume of requests, size of data and computing load. In Bioinformatics field, such as in Next Generation Sequencing technology, in order to have more accurate analysis, it increases the amount of data to process. A new protocol for sequencing the messenger RNA in a cell, known as RNA-Seq, produces millions of short sequence fragments in a single run. These fragments can be used to measure levels of gene expression and to identify novel splice variants of genes.