The circuits and robust design methodology of the massively parallel processor based on the matrix architecture

被引:3
|
作者
Noda, Hideyuki [1 ]
Tanizaki, Tetsushi [1 ]
Gyohten, Takayuki [1 ]
Dosaka, Katsumi [1 ]
Nakajima, Masami [1 ]
Mizumoto, Katsuya [1 ]
Yoshida, Kanako [1 ]
Iwao, Takenobu [1 ]
Nishijima, Tetsu [1 ]
Okuno, Yoshihiro [1 ]
Arimoto, Kazutami [1 ]
机构
[1] Renesas Technol Corp, Itami, Hyogo 6640005, Japan
关键词
CMOS; integrated circuits; low power; memory; parallel processor; SIMD;
D O I
10.1109/JSSC.2007.891680
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Novel circuits and design methodology of the massively parallel processor based on the matrix architecture are introduced. A fine-grained processing elements (PE) circuit for high-throughput MAC operations based on the Booth's algorithm enhances the performance of a 16-bit fixed-point signed MAC, which operates up to 30.0 GOPS/W. The dedicated I/O interface circuits are designed for converting the direction of data access and supporting the interleaved memory architecture, and they are implemented for maximizing the processor core efficiency. Power management techniques for suppressing current peaks and reducing average power consumption are proposed to enhance the robustness of the macro. The circuits and the design methodology proposal in this paper are attractive for achieving a high performance and robust massively parallel SIMD processor core employed in multimedia SoCs.
引用
收藏
页码:804 / 812
页数:9
相关论文
共 50 条
  • [21] Gradient method based design methodology for time and area optimization of a pipelined attached processor architecture
    Jagannath, KR
    Gibson, GA
    FOURTH INTERNATIONAL CONFERENCE ON HIGH-PERFORMANCE COMPUTING, PROCEEDINGS, 1997, : 272 - 276
  • [22] Democrat: A design methodology for the conception of robot with parallel architecture
    Merlet, JP
    IROS '97 - PROCEEDINGS OF THE 1997 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOT AND SYSTEMS: INNOVATIVE ROBOTICS FOR REAL-WORLD APPLICATIONS, VOLS 1-3, 1996, : 1630 - 1636
  • [23] Design methodology of regular logic bricks for robust integrated circuits
    Tong, Kim Yaw
    Rovner, Vyacheslav
    Pileggi, Lawrence T.
    Kheterpal, Veerbhan
    PROCEEDINGS 2006 INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, 2007, : 162 - +
  • [24] Design methodology for a modular service-driven network processor architecture
    Gabrani, M
    Dittmann, G
    Döring, A
    Herkersdorf, A
    Sagmeister, P
    van Lunteren, J
    COMPUTER NETWORKS, 2003, 41 (05) : 623 - 640
  • [25] A prefetch architecture design based on graphics processor compression architecture
    Zhao S.
    Zhang L.
    Zhang L.
    High Technology Letters, 2022, 32 (04) : 351 - 357
  • [26] Design of a tokenless architecture for parallel computations using associative dataflow processor
    Jamil, T
    Deshmukh, RG
    PROCEEDINGS OF THE IEEE SOUTHEASTCON '96: BRINGING TOGETHER EDUCATION, SCIENCE AND TECHNOLOGY, 1996, : 649 - 656
  • [27] DESIGN ISSUES AND AN ARCHITECTURE FOR THE MONOLITHIC IMPLEMENTATION OF A PARALLEL DIGITAL SIGNAL PROCESSOR
    FELLMAN, RD
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1990, 38 (05): : 839 - 852
  • [28] A Game-of-Life-based Paradigm for Massively Parallel Computing on Asynchronous Circuits
    Ye, Ya-Hui
    Lee, Jia
    Zhu, Li-Wei
    JOURNAL OF CELLULAR AUTOMATA, 2018, 13 (04) : 287 - 305
  • [29] A pipelined processor suitable for a bus-based parallel architecture
    Arad, BS
    Shih, HR
    COMPUTERS AND THEIR APPLICATIONS, 2001, : 485 - 488
  • [30] Improving matrix-based dynamic programming on massively parallel accelerators
    Bednarek, David
    Brabec, Michal
    Krulis, Martin
    INFORMATION SYSTEMS, 2017, 64 : 175 - 193