Power aeware framework for dense matrix operations in multimedia processors

被引:0
|
作者
Azeemi, N. Zafar [1 ]
机构
[1] Vienna Univ Technol, Inst Commun & Radio Frequency Engn, Christian Doppler Lab Design Methodol Signal Proc, A-1040 Vienna, Austria
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we analyze(1) the use of Decision Tree Grafting, Blocking and Loop Unfolding to improve the performance of dense matrix computations on high performance multimedia processors. The analysis focuses on the practical aspects that can be observed with multilayered memory levels. The problem is studies on the Philips Nexperia processor. The experimental evaluation of the proposed approach results into better exploitation of functional units, memory hierarchy and highway usage of the target processor. The advantages of the proposed interactive code transformation approach are two-folds. First, effort in optimization is spent only when the program measurement (transformation cost) determines that the effort is necessary and potentially beneficial, and only on those portions of the program where the energy/cycle performance payoff appears to be high. Second, by concatenating subsequent energy/cycle profile-driven low level transformations for higher level manipulations, the system will provide the programmer with a powerful toolset. The approach is illustrated using functional unit usage within a VLIW architecture for low power, which improves energy dissipation up to 34% and CPU performance up to 87% for an idct example.
引用
收藏
页码:718 / 723
页数:6
相关论文
共 50 条
  • [1] Modeling power and energy consumption of dense matrix factorizations on multicore processors
    Alonso, Pedro
    Dolz, Manuel F.
    Mayo, Rafael
    Quintana-Orti, Enrique S.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2014, 26 (17): : 2743 - 2757
  • [2] Performance and Power Consumption Investigation for Execution of Integer Operations on CPU and GPU Processors for Multimedia Applications
    Iovanovici, A.
    Visan, C.
    Marcu, M.
    2009 7TH INTERNATIONAL SYMPOSIUM ON INTELLIGENT SYSTEMS AND INFORMATICS, 2009, : 258 - 262
  • [3] Scheduling dense linear algebra operations on multicore processors
    Kurzak, Jakub
    Ltaief, Hatem
    Dongarra, Jack
    Badia, Rosa M.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2010, 22 (01): : 15 - 44
  • [4] Sparse matrix operations in vector and parallel processors
    Doallo, R
    Tourino, J
    Hermo, FM
    APPLICATIONS OF HIGH PERFORMANCE COMPUTING IN ENGINEERING V, 1997, 3 : 43 - 52
  • [5] MATRIX AND VECTOR OPERATIONS ON HYPERCUBE PARALLEL PROCESSORS
    MCBRYAN, OA
    VANDEVELDE, EF
    PARALLEL COMPUTING, 1987, 5 (1-2) : 117 - 125
  • [6] Synthesis of systolic processors for computing matrix operations
    Acad of Sciences of Belarus, Minsk, Belarus
    Eng Simul, 5 (739-747):
  • [7] Processors pack power for wireless multimedia mobility
    Nelson, R
    EDN, 1997, 42 (11) : S12 - &
  • [8] Low-power architectures for programmable multimedia processors
    Nishitani, T
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 1999, E82A (02) : 184 - 196
  • [9] Power-aware scoreboard alternatives for multimedia processors
    Baniasadi, Amirali
    Salamat, Babak
    Deris, Kaveh Jokar
    MICROPROCESSORS AND MICROSYSTEMS, 2009, 33 (04) : 326 - 332
  • [10] DENSE MATRIX OPERATIONS ON A TORUS AND A BOOLEAN CUBE
    JOHNSSON, SL
    AFIPS CONFERENCE PROCEEDINGS, 1985, 54 : 307 - &