Multicore Surprises: Lessons Learned from Optimizing Sweep3D on the Cell Broadband Engine

Provided by: Institute of Electrical & Electronic Engineers
Topic: Hardware
Format: PDF
Over the last decade high-performance computing has ridden the wave of commodity computing, building cluster-based parallel computers that were able to leverage the tremendous growth in processor performance fueled by the commercial world. The Cell Broadband Engine (Cell BE) processor provides the potential to achieve an impressive level of performance for scientific applications. This level of performance can be reached by exploiting several dimensions of parallelism, such as thread-level parallelism using several synergistic processing elements, data streaming parallelism, vector parallelism in the form of 128-bit SIMD operations, and pipeline parallelism by issuing multiple instructions in the same clock cycle.

Find By Topic