Performance Evaluation of LU Factorization Through Hardware Counter Measurements

Download Now
Provided by: University of Tehran
Topic: Storage
Format: PDF
The growing demand for scalable and effective scientific and numerical libraries on multicore architectures forces hardware manufacturers to design solutions that improve both the processor speed and transfer rates between their memory hierarchies. Several studies show that these improvement factors are disproportionate and may vary widely from one architecture to another and then have a strong impact on the tuning and the performance prediction of numerical libraries. In this paper, the authors analyze the communication and performance of some routines in well known libraries on different architectures and they establish a relation model between hardware parameters and performance.
Download Now

Find By Topic