Transitive Closure on the Cell Broadband Engine: A study on Self-Scheduling in a Multicore Processor

被引:0
|
作者
Vinjamuri, Sudhir [1 ]
Prasanna, Viktor K. [1 ]
机构
[1] Univ So Calif, Dept Elect Engn, Los Angeles, CA 90007 USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper we present a mapping methodology and optimizations for solving transitive closure on the Cell multicore processor Using our approach, it is possible to achieve near peak performance for transitive closure on the Cell processor We first parallelize the standard Floyd Warshall algorithm and show through analysis and experimental results that data communication is a bottleneck for performance and scalability. We parallelize a cache optimized version of Floyd Warshall algorithm to remove the memory bottleneck. As is the case with several scientific computing and industrial applications on a multicore processor, synchronization and scheduling of the cores plays a crucial role in determining the performance of this algorithm. We define a self-scheduling mechanism for the cores of a multicore processor and design a self-scheduler for Blocked Floyd Warshall algorithm on the Cell multicore processor to remove the scheduling bottleneck. We also present optimizations in scheduling order to remove synchronization points. Our implementations achieved up to 78GFLOPS.
引用
收藏
页码:999 / 1009
页数:11
相关论文
共 50 条
  • [21] Video analysis and compression on the STI cell broadband engine processor
    Liu, Lurng-Kuo
    Kesavarapu, Sreeni
    Connell, Jonathan
    Jagmohan, Ashish
    Leem, Lark-Boon
    Paulovicks, Brent
    Sheinin, Vadim
    Tang, Lijung
    Yeo, Hangu
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 29 - +
  • [22] CellSs: Making it easier to program the cell broadband engine processor
    Perez, J. M.
    Bellens, P.
    Badia, R. M.
    Labarta, J.
    IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2007, 51 (05) : 593 - 604
  • [23] A Resonant Global Clock Distribution for the Cell Broadband Engine Processor
    Chan, Steven C.
    Restle, Phillip J.
    Bucelot, Thomas J.
    Liberty, John S.
    Weitzel, Stephen
    Keaty, John M.
    Flachs, Brian
    Volant, Richard
    Kapusta, Peter
    Zimmerman, Jeffrey S.
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2009, 44 (01) : 64 - 72
  • [24] PASSIVE COHERENT LOCATOR SIGNAL PROCESSOR ON IBM CELL BROADBAND ENGINE (CELL BE)
    Cantini, C.
    La Rosa, E.
    Lo Re, A.
    Di Lallo, A.
    2009 IEEE RADAR CONFERENCE, VOLS 1 AND 2, 2009, : 436 - +
  • [25] Massive video-surveillance parallelization on the Cell Broadband Engine processor
    Rabie, T.
    Kidwai, H. K.
    Sibai, F. N.
    IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2010, 54 (06)
  • [26] High performance combinatorial algorithm design on the Cell Broadband Engine processor
    Bader, David A.
    Agarwal, Virat
    Madduri, Kamesh
    Kang, Seunghwa
    PARALLEL COMPUTING, 2007, 33 (10-11) : 720 - 740
  • [27] Real-time computed tomography on the cell broadband engine processor
    Bockenbach, Olivier
    Knaup, Michael
    Kachelriess, Marc
    IMAGE PROCESSING: ALGORITHMS AND SYSTEMS VI, 2008, 6812
  • [28] Automating the self-scheduling process of nurses in Swedish healthcare: a pilot study
    Elina Rönnberg
    Torbjörn Larsson
    Health Care Management Science, 2010, 13 : 35 - 53
  • [29] A Constraint Programming Approach for Allocation and Scheduling on the CELL Broadband Engine
    Benini, Luca
    Lombardi, Michele
    Milano, Michela
    Ruggiero, Martino
    PRINCIPLES AND PRACTICE OF CONSTRAINT PROGRAMMING, 2008, 5202 : 21 - 35
  • [30] Automating the self-scheduling process of nurses in Swedish healthcare: a pilot study
    Ronnberg, Elina
    Larsson, Torbjorn
    HEALTH CARE MANAGEMENT SCIENCE, 2010, 13 (01) : 35 - 53