Optimizing power efficiency for 3D stacked GPU-in-memory architecture

被引:8
|
作者
Wen, Wen [1 ]
Yang, Jun [1 ]
Zhang, Youtao [2 ]
机构
[1] Univ Pittsburgh, Dept Elect & Comp Engn, Pittsburgh, PA USA
[2] Univ Pittsburgh, Dept Comp Sci, Pittsburgh, PA USA
基金
美国国家科学基金会;
关键词
GPU; Stacked memory; NoC; Power efficiency; FUTURE;
D O I
10.1016/j.micpro.2017.01.005
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the prevalence of data-centric computing, the key to achieving energy efficiency is to reduce the latency and energy cost of data movement. Near data processing (NDP) is a such technique which, instead of moving data around, moves computing closer to where data is stored. The emerging 3D stacked memory brings such opportunities for achieving both high power-efficiency as well as less data movement overheads. In this paper, we exploit power efficient NDP architectures using the 3D stacked memory. We integrate the programmable GPU streaming multiprocessors into the NDP architectures, in order to fully exploit the bandwidth provided by 3D stacked memory. In addition, we study the tradeoffs between area, performance and power of the NDP components, especially the NoC designs. Our experimental results show that, compared to traditional architectures, the proposed GPU based NDP architectures can achieve up to 43.8% reduction in EDP and 41.9% improvement in power efficiency in terms of performance-per-Watt. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:44 / 53
页数:10
相关论文
共 50 条
  • [41] 3-D Stacked Memory System Architecture Exploration by ESL Virtual Platform and Reconfigurable Stacking Memory Architecture in 3D-DSP SoC System
    Hsieh, Hsien-Ching
    Sun, Yi-Fa
    Yeh, Jen-Chieh
    Huang, Po-Han
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [42] IMCI: an efficient fingerprint retrieval approach based on 3D stacked memory
    Wen Cheng
    Ran Cai
    Lingfang Zeng
    Dan Feng
    André Brinkmann
    Yang Wang
    Science China Information Sciences, 2020, 63
  • [43] Air Cooling Limits of 3D Stacked Logic Processor and Memory Dies
    Kumari, Niru
    Shih, Rocky
    Escobar-Vargas, Sergio
    Cader, Tahir
    Govyadinov, Alexander
    Anthony, Sarah
    Bash, Cullen
    2014 IEEE INTERSOCIETY CONFERENCE ON THERMAL AND THERMOMECHANICAL PHENOMENA IN ELECTRONIC SYSTEMS (ITHERM), 2014, : 92 - 97
  • [44] IMCI: an efficient fingerprint retrieval approach based on 3D stacked memory
    Wen CHENG
    Ran CAI
    Lingfang ZENG
    Dan FENG
    André BRINKMANN
    Yang WANG
    ScienceChina(InformationSciences), 2020, 63 (07) : 233 - 235
  • [45] IMCI: an efficient fingerprint retrieval approach based on 3D stacked memory
    Cheng, Wen
    Cai, Ran
    Zeng, Lingfang
    Feng, Dan
    Brinkmann, Andre
    Wang, Yang
    SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (07)
  • [46] Efficient Memory Access Patterns for Solving 3D Laplace Equation on GPU
    Akhtar, Muhammad Naveed
    Durad, Muhammad Hanif
    Usman, Anila
    Mughal, Muhammad Abid
    IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY TRANSACTION A-SCIENCE, 2018, 42 (A2): : 623 - 633
  • [47] Exploring the Vulnerability of CMPs to Soft Errors with 3D Stacked Nonvolatile Memory
    Sun, Guangyu
    Kursun, Eren
    Rivers, Jude A.
    Xie, Yuan
    ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2013, 9 (03)
  • [48] Efficient Memory Access Patterns for Solving 3D Laplace Equation on GPU
    Muhammad Naveed Akhtar
    Muhammad Hanif Durad
    Anila Usman
    Muhammad Abid Mughal
    Iranian Journal of Science and Technology, Transactions A: Science, 2018, 42 : 623 - 633
  • [49] Efficient Data Management on 3D Stacked Memory for Big Data Applications
    Qian, Cheng
    Huang, Libo
    Xie, Peng
    Xiao, Nong
    Wang, Zhiying
    2015 10TH INTERNATIONAL DESIGN & TEST SYMPOSIUM (IDT), 2015, : 84 - 89
  • [50] High-Performance RF-Interconnect for 3D Stacked Memory
    Alzahmi, Ahmed
    Mirzaie, Nahid
    Lin, Chung-Ching
    Kim, Insoo
    Byun, Gyung-Su
    PROCEEDINGS INTERNATIONAL SOC DESIGN CONFERENCE 2017 (ISOCC 2017), 2017, : 109 - 110