Efficient Performance Evaluation of Memory Hierarchy for Highly Multithreaded Graphics Processors

Download Now
Provided by: Association for Computing Machinery
Topic: Storage
Format: PDF
With the emergence of highly multithreaded architectures, performance monitoring techniques face new challenges in efficiently locating sources of performance discrepancies in the program source code. For example, the state-of-the-art performance counters in highly multithreaded Graphics Processing Units (GPUs) report only the overall occurrences of microarchitecture events at the end of program execution. Furthermore, even if supported, any fine-grained sampling of performance counters will distort the actual program behavior and will make the sampled values inaccurate. On the other hand, it is difficult to achieve high resolution performance information at low sampling rates in the presence of thousands of concurrently running threads.
Download Now

Find By Topic