Compiler and Runtime Support for Enabling Generalized Reduction Computations on Heterogeneous Parallel Configurations
A trend that has materialized, and has given rise to much attention, is of the increasingly heterogeneous computing platforms. Presently, it has become very common for a desktop or a notebook computer to come equipped with both a multi-core CPU and a GPU. Capitalizing on the maximum computational power of such architectures (i.e., by simultaneously exploiting both the multi-core CPU and the GPU) starting from a high-level API is a critical challenge. The authors believe that it would be highly desirable to support a simple way for programmers to realize the full potential of today's heterogeneous machines.