*Result*: Using advanced compiler technology to exploit the performance of the Cell Broadband Engine™ architecture.

Title:
Using advanced compiler technology to exploit the performance of the Cell Broadband Engine™ architecture.
Authors:
Eichenberger, Alexandre E.1 alexe@us.ibm.com, O'Brien, John Kevin2 caomhin@us.ibm.com, O'Brien, Kathryn M.2 kmob@us.ibm.com, Wu, Peng2 pengwu@us.ibm.com, Chen, Tong2 chentong@us.ibm.com, Oden, Peter H.2 oden@us.ibm.com, Prener, D. A.3 prener@us.ibm.com, Shepherd, Janice C.4 janshep@us.ibm.com, So, Byoungro2 bso@us.ibm.com, Sura, Zehra1 zsura@us.ibm.com, Wang, Amy5 aktwang@ca.ibm.com, Zhang, Tao6 zhangtao@cc.gatech.edu, Zhao, Peng5 pengz@ca.ibm.com, Gschwind, Michael K.2 mkg@us.ibm.com, Archambault, Roch7 archie@ca.ibm.com, Gao, Yaoqing7 ygao@ca.ibm.com, Koo, Roland8 rkoo@ca.ibm.com
Source:
IBM Systems Journal. 2006, Vol. 45 Issue 1, p59-84. 26p.
Database:
Academic Search Index

*Further Information*

*The continuing importance of game applications and other numerically intensive workloads has generated an upsurge in novel computer architectures tailored for such functionality. Game applications feature highly parallel code for functions such as game physics, which have high computation and memory requirements, and scalar code for functions such as game artificial intelligence, for which fast response times and a full-featured programming environment are critical. The Cell Broadband Engine™ architecture targets such applications, providing both flexibility and high performance by utilizing a 64-bit multithreaded PowerPC® processor element (PPE) with two levels of globally coherent cache and eight synergistic processor elements (SPEs), each consisting of a processor designed for streaming workloads, a local memory, and a globally coherent DMA (direct memory access) engine. Growth in processor complexity is driving a parallel need for sophisticated compiler technology. In this paper, we present a variety of compiler techniques designed to exploit the performance potential of the SPEs and to enable the multilevel heterogeneous parallelism found in the Cell Broadband Engine architecture. Our goal in developing this compiler has been to enhance programmability while continuing to provide high performance. We review the Cell Broadband Engine architecture and present the results of our compiler techniques, including SPE optimization, automatic code generation, single source parallelization, and partitioning. [ABSTRACT FROM AUTHOR]*