Interactive Analytical Processing in Big Data Systems: A Cross-Industry Study of MapReduce Workloads

Within the past few years, organizations in diverse industries have adopted MapReduce-based systems for large-scale data processing. Along with these new users, important new workloads have emerged which feature many small, short, and increasingly interactive jobs in addition to the large, long-running batch jobs for which MapReduce was originally designed. As interactive, large-scale query processing is strength of the RDBMS community, it is important that lessons from that field be carried over and applied where possible in this new domain. However, these new workloads have not yet been described in the literature.

Provided by: University of Calgary Topic: Data Management Date Added: Jun 2012 Format: PDF

Find By Topic