Optimizing Memory Performance for FPGA Implementation of PageRank

被引:0
|
作者
Zhou, Shijie [1 ]
Chelmis, Charalampos [1 ]
Prasanna, Viktor K. [1 ]
机构
[1] Univ Southern Calif, Ming Hsieh Dept Elect Engn, Los Angeles, CA 90007 USA
关键词
FPGA; PageRank; Memory performance;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, FPGA implementation of graph algorithms arising in many areas such as social networks has been studied. However, the irregular memory access pattern of graph algorithms makes obtaining high performance challenging. In this paper, we present an FPGA implementation of the classic PageRank algorithm. Our goal is to optimize the overall system performance, especially the cost of accessing the off-chip DRAM. We optimize the data layout so that most of memory accesses to the DRAM are sequential. Post-place-and-route results show that our design on a state-of-the-art FPGA can achieve a high clock rate of over 200 MHz. Based on a realistic DRAM access model, we build a simulator to estimate the execution time including memory access overheads. The simulation results show that our design achieves at least 96% of the theoretically best performance of the target platform. Compared with a baseline design, our optimized design dramatically reduces the number of random memory accesses and improves the execution time by at least 70%.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Optimizing FPGA Performance, Power, and Dependability with Linear Programming
    Wulf, Nicholas
    George, Alan D.
    Gordon-Ross, Ann
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2017, 10 (03)
  • [22] Optimizing the Performance of IoT Using FPGA as Compared to GPU
    Nair, Rajit
    Sharma, Preeti
    Sharma, Tripti
    INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2022, 14 (01)
  • [23] Towards an FPGA Solver for the PageRank Eigenvector Problem
    McGettrick, Seamas
    Geraghty, Dermot
    McElroy, Ciaran
    PARALLEL COMPUTING: ARCHITECTURES, ALGORITHMS AND APPLICATIONS, 2008, 15 : 793 - 800
  • [24] Optimizing the SUSAN corner detection algorithm for a high speed FPGA implementation
    Claus, Christopher
    Huitl, Robert
    Rausch, Joachim
    Stechele, Walter
    FPL: 2009 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS, 2009, : 138 - 145
  • [25] A Holistic Approach for Optimizing DSP Block Utilization of a CNN implementation on FPGA
    Abdelouahab, Kamel
    Bourrasset, Cedric
    Pelcat, Maxime
    Berry, Francois
    Quinton, Jean-Charles
    Serot, Jocelyn
    ICDSC 2016: 10TH INTERNATIONAL CONFERENCE ON DISTRIBUTED SMART CAMERA, 2016, : 69 - 75
  • [26] Stack Memory Implementation and Analysis of Timing Constraint, Power and Memory using FPGA
    Thind, Vandana
    Pandey, Nisha
    Pandey, Bishwajeet
    Hussain, D. M. Akbar
    2017 9TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2017, : 215 - 220
  • [27] Implementation and Performance Comparison of Digital Filter in FPGA
    Thesni, K.
    Praveen, K.
    Srivani, L.
    2020 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2020, : 589 - 594
  • [28] High performance FPGA implementation of the mersenne twister
    Chandrasekaran, Shrutisagar
    Amira, Abbes
    DELTA 2008: FOURTH IEEE INTERNATIONAL SYMPOSIUM ON ELECTRONIC DESIGN, TEST AND APPLICATIONS, PROCEEDINGS, 2008, : 482 - 485
  • [29] High performance IIR filter implementation on FPGA
    Debarshi Datta
    Himadri Sekhar Dutta
    Journal of Electrical Systems and Information Technology, 8 (1)
  • [30] FPGA implementation of high performance FIR filters
    Kollig, P
    AlHashimi, BM
    Abbott, KM
    ISCAS '97 - PROCEEDINGS OF 1997 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS I - IV: CIRCUITS AND SYSTEMS IN THE INFORMATION AGE, 1997, : 2240 - 2243