Starfish: A Selftuning System for Big Data Analytics

Timely and cost-effective analytics over "Big Data" is now a key ingredient for success in many businesses, scientific and engineering disciplines, and government endeavors. The Hadoop software stack - which consists of an extensible MapReduce execution engine, pluggable distributed storage engines, and a range of procedural to declarative interfaces - is a popular choice for big data analytics. Most practitioners of big data analytics - like computational scientists, systems researchers, and business analysts - lack the expertise to tune the system to get good performance.

Provided by: Duke University Topic: Data Management Date Added: Jan 2011 Format: PDF

Find By Topic