Threads vs. Caches: Modeling the Behavior of Parallel Workloads
A new generation of high-performance engines now combines graphics-oriented parallel processors with cache architecture. In order to meet this new trend, new highly parallel workloads are being developed. However, it is often difficult to predict how a given application would perform on a given architecture. This paper provides a new model capturing the behavior of such parallel workloads on different multi-core architectures. Specifically, the authors provide a simple analytical model, which, for a given application, describes its performance and power as a function of the number of threads it runs in parallel, on a range of architectures.