Data Management

Starfish: A Selftuning System for Big Data Analytics

Download Now Date Added: Jan 2011
Format: PDF

Timely and cost-effective analytics over "Big Data" is now a key ingredient for success in many businesses, scientific and engineering disciplines, and government endeavors. The Hadoop software stack - which consists of an extensible MapReduce execution engine, pluggable distributed storage engines, and a range of procedural to declarative interfaces - is a popular choice for big data analytics. Most practitioners of big data analytics - like computational scientists, systems researchers, and business analysts - lack the expertise to tune the system to get good performance.