Data Management

Starfish: A Selftuning System for Big Data Analytics

Free registration required

Executive Summary

Timely and cost-effective analytics over "Big Data" is now a key ingredient for success in many businesses, scientific and engineering disciplines, and government endeavors. The Hadoop software stack - which consists of an extensible MapReduce execution engine, pluggable distributed storage engines, and a range of procedural to declarative interfaces - is a popular choice for big data analytics. Most practitioners of big data analytics - like computational scientists, systems researchers, and business analysts - lack the expertise to tune the system to get good performance.

  • Format: PDF
  • Size: 1781.9 KB