Power-efficient Computing for Compute-intensive GPGPU Applications

被引:0
|
作者
Gilani, Syed Zohaib [1 ]
Kim, Nam Sung [1 ]
Schulte, Michael J.
机构
[1] Univ Wisconsin, Madison, WI 53706 USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The peak compute performance of GPUs has been increased by integrating more compute resources and operating them at higher frequency. However, such approaches significantly increase power consumption of GPUs, limiting further performance increase due to the power constraint. Facing such a challenge, we propose three techniques to improve power efficiency and performance of GPUs in this paper. First, we observe that many GPGPU applications are integer-intensive. For such applications, we combine a pair of dependent integer instructions into a composite instruction that can be executed by an enhanced fused multiply-add unit. Second, we observe that computations for many instructions are duplicated across multiple threads. We dynamically detect such instructions and execute them in a separate scalar unit. Finally, we observe that 16 or fewer bits are sufficient for accurate representation of operands and results of many instructions. Thus, we split the 32-bit datapath into two 16-bit datapath slices that can concurrently issue and execute up to two such instructions per cycle. All three proposed techniques can considerably increase utilization of compute resources, improving power efficiency and performance by 20% and 15%, respectively.
引用
收藏
页码:330 / 341
页数:12
相关论文
共 50 条
  • [21] Compute-intensive methods in artificial intelligence
    Selman, B
    ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2000, 28 (1-4) : 35 - 38
  • [22] A Heterogeneous System Architecture for Low-Power Wireless Sensor Nodes in Compute-intensive Distributed Applications
    Engel, Andreas
    Koch, Andreas
    Siebel, Thomas
    2015 IEEE 40TH LOCAL COMPUTER NETWORKS CONFERENCE WORKSHOPS (LCN WORKSHOPS), 2015, : 636 - 644
  • [23] Power-Efficient Server Utilization in Compute Clouds
    Lenhardt, Joerg
    Schiffmann, Wolfram
    Eitschberger, Patrick
    Keller, Joerg
    2013 THIRD BERKELEY SYMPOSIUM ON ENERGY EFFICIENT ELECTRONIC SYSTEMS (E3S), 2013,
  • [24] Performance evaluation of GPU- and cluster-computing for parallelization of compute-intensive tasks
    Doeschl, Alexander
    Keller, Max-Emanuel
    Mandl, Peter
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2021, 17 (04) : 377 - 402
  • [25] Indirect cube: A power-efficient topology for compute clusters
    Navaridas, Javier
    Miguel-Alonso, Jose
    OPTICAL SWITCHING AND NETWORKING, 2011, 8 (03) : 162 - 170
  • [26] The VuSystem: A programming system for compute-intensive multimedia
    Lindblad, CJ
    Tennenhouse, DL
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 1996, 14 (07) : 1298 - 1313
  • [27] A Ferroelectric FET based Power-efficient Architecture for Data-intensive Computing
    Long, Yun
    Na, Taesik
    Rastogi, Prakshi
    Rao, Karthik
    Khan, Asif Islam
    Yalamanchili, Sudhakar
    Mukhopadhyay, Saibal
    2018 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD) DIGEST OF TECHNICAL PAPERS, 2018,
  • [28] Pipeline parallelism with reduced network communication for efficient compute-intensive neural network training
    Yu, Chanhee
    Park, Kyongseok
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (05):
  • [29] Deployment of Run-Time Reconfigurable Hardware Coprocessors Into Compute-Intensive Embedded Applications
    Fons, Francisco
    Fons, Mariano
    Canto, Enrique
    Lopez, Mariano
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2012, 66 (02): : 191 - 221
  • [30] Efficient Compute-Intensive Job Allocation in Data Centers via Deep Reinforcement Learning
    Yi, Deliang
    Zhou, Xin
    Wen, Yonggang
    Tan, Rui
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (06) : 1474 - 1485