Power aeware framework for dense matrix operations in multimedia processors

被引:0
|
作者
Azeemi, N. Zafar [1 ]
机构
[1] Vienna Univ Technol, Inst Commun & Radio Frequency Engn, Christian Doppler Lab Design Methodol Signal Proc, A-1040 Vienna, Austria
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we analyze(1) the use of Decision Tree Grafting, Blocking and Loop Unfolding to improve the performance of dense matrix computations on high performance multimedia processors. The analysis focuses on the practical aspects that can be observed with multilayered memory levels. The problem is studies on the Philips Nexperia processor. The experimental evaluation of the proposed approach results into better exploitation of functional units, memory hierarchy and highway usage of the target processor. The advantages of the proposed interactive code transformation approach are two-folds. First, effort in optimization is spent only when the program measurement (transformation cost) determines that the effort is necessary and potentially beneficial, and only on those portions of the program where the energy/cycle performance payoff appears to be high. Second, by concatenating subsequent energy/cycle profile-driven low level transformations for higher level manipulations, the system will provide the programmer with a powerful toolset. The approach is illustrated using functional unit usage within a VLIW architecture for low power, which improves energy dissipation up to 34% and CPU performance up to 87% for an idct example.
引用
收藏
页码:718 / 723
页数:6
相关论文
共 50 条
  • [21] DVFS-control techniques for dense linear algebra operations on multi-core processors
    Alonso, Pedro
    Dolz, Manuel F.
    Igual, Francisco D.
    Mayo, Rafael
    Quintana-Orti, Enrique S.
    COMPUTER SCIENCE-RESEARCH AND DEVELOPMENT, 2012, 27 (04): : 289 - 298
  • [22] Using non-canonical array layouts in dense matrix operations
    Herrero, Jose R.
    Navarro, Juan J.
    APPLIED PARALLEL COMPUTING: STATE OF THE ART IN SCIENTIFIC COMPUTING, 2007, 4699 : 580 - +
  • [23] A performance model of dense matrix operations on many-core architectures
    Long, Guoping
    Fan, Dongrui
    Zhang, Junchao
    Song, Fenglong
    Yuan, Nan
    Lin, Wei
    EURO-PAR 2008 PARALLEL PROCESSING, PROCEEDINGS, 2008, 5168 : 120 - 129
  • [24] Elemental: A New Framework for Distributed Memory Dense Matrix Computations
    Poulson, Jack
    Marker, Bryan
    van de Geijn, Robert A.
    Hammond, Jeff R.
    Romero, Nichols A.
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2013, 39 (02):
  • [25] A framework for dense triangular matrix kernels on various manycore architectures
    Charara, Ali
    Keyes, David
    Ltaief, Hatem
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2017, 29 (15):
  • [26] NUMA-Aware Dense Matrix Factorizations and Inversion with Look-Ahead on Multicore Processors
    Catalan, Sandra
    Igual, Francisco D.
    Rodriguez-Sanchez, Rafael
    Herrero, Jose R.
    Quintana-Orti, Enrique S.
    2022 IEEE 34TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD 2022), 2022, : 91 - 99
  • [27] A flexible matrix algebra framework for the multimedia multipathway modeling of emission to impacts
    Rosenbaum, Ralph K.
    Margni, Manuele
    Jolliet, Olivier
    ENVIRONMENT INTERNATIONAL, 2007, 33 (05) : 624 - 634
  • [28] Technical perspective for: MATLANG: Matrix operations and their expressive power
    Suciu D.
    SIGMOD Record, 2019, 48 (01): : 59
  • [30] Power-Aware Characteristics of Matrix Operations on Multicores
    Konnurmath, Guruprasad
    Chickerur, Satyadhyan
    APPLIED ARTIFICIAL INTELLIGENCE, 2021, 35 (15) : 2102 - 2123