Data Centers

Designing Multi-Leader-Based Allgather Algorithms for Multi-Core Clusters

Free registration required

Executive Summary

The increasing demand for computational cycles is being met by the use of multi-core processors. Having large number of cores per node necessitates multi-core aware designs to extract the best performance. The Message Passing Interface (MPI) is the dominant parallel programming model on modern high performance computing clusters. The MPI collective operations take a significant portion of the communication time for an application. The existing optimizations for collectives exploit shared memory for intra-node communication to improve performance.

  • Format: PDF
  • Size: 163.3 KB