Application-Aware Memory System for Fair and Efficient Execution of Concurrent GPGPU Applications
The available computing resources in modern GPUs are growing with each new generation. However, as many general purpose applications with limited thread-scalability are tuned to take advantage of GPUs, available compute resources might not be optimally utilized. To address this, modern GPUs will need to execute multiple kernels simultaneously. As current generations of GPUs (e.g., NVIDIA Kepler, AMD Radeon) already enable concurrent execution of kernels from the same application, in this paper, the authors address the next logical step: executing multiple concurrent applications in GPUs.