Transitive Closure on the Cell Broadband Engine: A study on Self-Scheduling in a Multicore Processor

被引:0
|
作者
Vinjamuri, Sudhir [1 ]
Prasanna, Viktor K. [1 ]
机构
[1] Univ So Calif, Dept Elect Engn, Los Angeles, CA 90007 USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper we present a mapping methodology and optimizations for solving transitive closure on the Cell multicore processor Using our approach, it is possible to achieve near peak performance for transitive closure on the Cell processor We first parallelize the standard Floyd Warshall algorithm and show through analysis and experimental results that data communication is a bottleneck for performance and scalability. We parallelize a cache optimized version of Floyd Warshall algorithm to remove the memory bottleneck. As is the case with several scientific computing and industrial applications on a multicore processor, synchronization and scheduling of the cores plays a crucial role in determining the performance of this algorithm. We define a self-scheduling mechanism for the cores of a multicore processor and design a self-scheduler for Blocked Floyd Warshall algorithm on the Cell multicore processor to remove the scheduling bottleneck. We also present optimizations in scheduling order to remove synchronization points. Our implementations achieved up to 78GFLOPS.
引用
收藏
页码:999 / 1009
页数:11
相关论文
共 50 条
  • [1] DYNAMIC PROCESSOR SELF-SCHEDULING FOR GENERAL PARALLEL NESTED LOOPS
    FANG, ZX
    TANG, PY
    YEW, PC
    ZHU, CQ
    IEEE TRANSACTIONS ON COMPUTERS, 1990, 39 (07) : 919 - 929
  • [2] Verification of the Cell Broadband Engine™ processor
    Shimizu, Kanna
    Gupta, Sanjay
    Koyama, Tatsuya
    Omizo, Takashi
    Abdulhafiz, Jamee
    McConville, Larry
    Swanson, Todd
    43RD DESIGN AUTOMATION CONFERENCE, PROCEEDINGS 2006, 2006, : 338 - +
  • [3] Cell broadband engine processor: Design and implementation
    Riley, M. W.
    Warnock, J. D.
    Wendel, D. F.
    IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2007, 51 (05) : 545 - 557
  • [4] Cell broadband engine processor design methodology
    Takahashi, O.
    Behnen, E.
    Cottier, S. R.
    Coulman, P.
    Dhong, S. H.
    Flachs, B.
    Hofstee, P.
    Johnson, C. J.
    Posluszny, S.
    PROCEEDINGS OF THE IEEE 2007 CUSTOM INTEGRATED CIRCUITS CONFERENCE, 2007, : 711 - 716
  • [5] A Security Review of the Cell Broadband Engine Processor
    Smith, Jessica
    He, Xiaohui
    Alves-Foss, Jim
    43RD HAWAII INTERNATIONAL CONFERENCE ON SYSTEMS SCIENCES VOLS 1-5 (HICSS 2010), 2010, : 4417 - 4424
  • [6] Accelerating computing with the cell broadband engine processor
    Crawford, Catherine
    Henning, Paul
    Kistler, Michael
    Wright, Cornell
    2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8, 2008, : 353 - 353
  • [7] A TRANSITIVE CLOSURE ALGORITHM FOR A 16-STATE CELL PROCESSOR
    KATONA, E
    LECTURE NOTES IN COMPUTER SCIENCE, 1989, 342 : 285 - 290
  • [8] Performance evaluation of enhancement of the layered self-scheduling approach for heterogeneous multicore cluster systems
    Chao-Chin Wu
    Lien-Fu Lai
    Liang-Tsung Huang
    MingLung Chen
    The Journal of Supercomputing, 2012, 62 : 399 - 430
  • [9] Automated patient self-scheduling: case study
    Woodcock, Elizabeth
    Sen, Aditi
    Weiner, Jonathan
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2022, 29 (09) : 1637 - 1641
  • [10] Performance evaluation of enhancement of the layered self-scheduling approach for heterogeneous multicore cluster systems
    Wu, Chao-Chin
    Lai, Lien-Fu
    Huang, Liang-Tsung
    Chen, MingLung
    JOURNAL OF SUPERCOMPUTING, 2012, 62 (01): : 399 - 430