Power aeware framework for dense matrix operations in multimedia processors

被引:0
|
作者
Azeemi, N. Zafar [1 ]
机构
[1] Vienna Univ Technol, Inst Commun & Radio Frequency Engn, Christian Doppler Lab Design Methodol Signal Proc, A-1040 Vienna, Austria
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we analyze(1) the use of Decision Tree Grafting, Blocking and Loop Unfolding to improve the performance of dense matrix computations on high performance multimedia processors. The analysis focuses on the practical aspects that can be observed with multilayered memory levels. The problem is studies on the Philips Nexperia processor. The experimental evaluation of the proposed approach results into better exploitation of functional units, memory hierarchy and highway usage of the target processor. The advantages of the proposed interactive code transformation approach are two-folds. First, effort in optimization is spent only when the program measurement (transformation cost) determines that the effort is necessary and potentially beneficial, and only on those portions of the program where the energy/cycle performance payoff appears to be high. Second, by concatenating subsequent energy/cycle profile-driven low level transformations for higher level manipulations, the system will provide the programmer with a powerful toolset. The approach is illustrated using functional unit usage within a VLIW architecture for low power, which improves energy dissipation up to 34% and CPU performance up to 87% for an idct example.
引用
收藏
页码:718 / 723
页数:6
相关论文
共 50 条
  • [31] Dense Dynamic Blocks: Optimizing SpMM for Processors with Vector and Matrix Units Using Machine Learning Techniques
    Yesil, Serif
    Moreira, Jose E.
    Torrellas, Josep
    PROCEEDINGS OF THE 36TH ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, ICS 2022, 2022,
  • [32] Green Novel Power Control Framework for Dense Femtocell Grids
    Al Haddad, Mazen
    Bayoumi, Magdy
    INTERNATIONAL CONFERENCE ON COMPUTER VISION AND IMAGE ANALYSIS APPLICATIONS, 2015,
  • [33] Auto-tuning Dense Vector and Matrix-Vector Operations for Fermi GPUs
    Sorensen, Hans Henrik Brandenborg
    PARALLEL PROCESSING AND APPLIED MATHEMATICS, PT I, 2012, 7203 : 619 - 629
  • [34] Reliability Assessment in Power Distribution Networks by Logical and Matrix Operations
    Midence, D.
    Rivera, S.
    Vargas, A.
    2008 IEEE/PES TRANSMISSION AND DISTRIBUTION CONFERENCE AND EXPOSITION: LATIN AMERICA, VOLS 1 AND 2, 2008, : 53 - 58
  • [35] Geometric tiling for reducing power consumption in structured matrix operations
    Chen, G.
    Xue, L.
    Kim, J.
    Sobti, K.
    Deng, L.
    Sun, X.
    Pitsianis, N.
    Chakrabarti, C.
    Kandemir, M.
    Vijaykrishnan, N.
    IEEE INTERNATIONAL SOC CONFERENCE, PROCEEDINGS, 2006, : 113 - +
  • [36] A backlight power management framework for battery-operated multimedia systems
    Shim, H
    Chang, NY
    Pedram, M
    IEEE DESIGN & TEST OF COMPUTERS, 2004, 21 (05): : 388 - 396
  • [37] A middleware framework coordinating Processor/Power Resource Management for multimedia applications
    Yuan, WH
    Nahrstedt, K
    GLOBECOM '01: IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, VOLS 1-6, 2001, : 1984 - 1988
  • [38] Multi-threaded dense linear algebra libraries for low-power asymmetric multicore processors
    Catalan, Sandra
    Herrero, Jose R.
    Igual, Francisco D.
    Rodriguez-Sanchez, Rafael
    Quintana-Orti, Enrique S.
    Adeniyi-Jones, Chris
    JOURNAL OF COMPUTATIONAL SCIENCE, 2018, 25 : 140 - 151
  • [39] GPU Acceleration of Dense Matrix and Block Operations for Lanczos Method for Systems Over GF(2)
    Zamarashkin, N. L.
    Zheltkov, D. A.
    LOBACHEVSKII JOURNAL OF MATHEMATICS, 2019, 40 (11) : 1881 - 1891
  • [40] GPU Acceleration of Dense Matrix and Block Operations for Lanczos Method for Systems Over GF(2)
    N. L. Zamarashkin
    D. A. Zheltkov
    Lobachevskii Journal of Mathematics, 2019, 40 : 1881 - 1891