A Novel Parallel QR Algorithm for Hybrid Distributed Memory HPC Systems

Download Now
Provided by: ETH Zurich
Topic: Hardware
Format: PDF
A novel variant of the parallel QR algorithm for solving dense nonsymmetric eigenvalue problems on hybrid distributed High Performance Computing (HPC) systems is presented. For this purpose, the authors introduce the concept of multi-window bulge chain chasing and parallelize aggressive early deflation. The multi-window approach ensures that most computations when chasing chains of bulges are performed in level 3 BLAS operations, while the aim of aggressive early deflation is to speed up the convergence of the QR algorithm. Mixed MPI-OpenMP coding techniques are utilized for porting the codes to distributed memory platforms with multithreaded nodes, such as multicore processors.
Download Now

Find By Topic