Big Data

A Study in Hadoop Streaming with Matlab for NMR data processing

Date Added: Apr 2010
Format: PDF

Applying Cloud computing techniques for analyzing large data sets has shown promise in many data-driven scientific applications. The approach presented here is to use Cloud computing for Nuclear Magnetic Resonance (NMR) data analysis which normally consists of large amounts of data. Biologists often use third party or commercial software for ease of use. Enabling the capability to use this kind of software in a Cloud will be highly advantageous in many ways. Scripting languages especially designed for clouds may not have the flexibility biologists need for their purposes. Although this is true, they are familiar with special software packages that allow them to write complex calculations with minimum effort, but are often not compatible with a Cloud environment.