Improving memory system performance for multimedia applications

被引:1
|
作者
Youn, Jonghee M. [1 ]
Cho, Doosan [2 ]
机构
[1] Yeungnam Univ, Dept Comp Engn, Gyongsan, South Korea
[2] Sunchon Natl Univ, Dept Elect & Elect Engn, Chungnam, Suncheon, South Korea
基金
新加坡国家研究基金会;
关键词
Memory system; Energy consumption; Compiler optimization; Restructuring; burst mode;
D O I
10.1007/s11042-015-2807-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The cost and performance of embedded systems heavily depends on the performance of memories it utilizes. Latency of a memory access is one of the major bottlenecks in the system performance. In software compilation, it is known that there are high variations in memory access latency depending on the ways of storing/retrieving variables in code to/from memories. To improve the latency, it needs a technique to maximize the use of memory bandwidth. A burst transfer is well known technique to maximally utilize memory bandwidth. The burst transfer capability offers an average access time reduction of more than 65 % for an eight-word sequential transfer. However, the problem of utilizing such burst transfers has not been generally addressed, and unfortunately, it is not tractable. In this work, we present a new technique that both identifies sequences of single load and store instructions for combining into burst transfers. The proposed technique provides an optimal data placement of nonarray variables to achieve the maximum utilization of burst data transfers. The major contributions of our work are, 1) we prove that the problem is NP-hard and 2) we propose an exact formulation of the problem and an efficient data placement algorithm. From experiments with a set of multimedia benchmarks, we confirm that our proposed technique uses on average 7 times more burst accesses than generated codes from ARM commercial compiler.
引用
收藏
页码:5951 / 5963
页数:13
相关论文
共 50 条
  • [41] Improving Memory Hierarchy Performance for Irregular Applications Using Data and Computation Reorderings
    John Mellor-Crummey
    David Whalley
    Ken Kennedy
    International Journal of Parallel Programming, 2001, 29 : 217 - 247
  • [42] Dolos: Improving the Performance of Persistent Applications in ADR-Supported Secure Memory
    Han, Xijing
    Tuck, James
    Awad, Amro
    PROCEEDINGS OF 54TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, MICRO 2021, 2021, : 1241 - 1253
  • [43] Memory bank conflict reduction for embedded multimedia applications
    Zhang, Q
    Li, Q
    Dai, YY
    Kuo, CCJ
    MULTIMEDIA SYSTEMS AND APPLICATIONS VI, 2003, 5241 : 123 - 134
  • [44] An early memory hierarchy evaluation simulator for multimedia applications
    Lioris, Theodoros
    Dimitroulakos, Grigoris
    Masselos, Konstantinos
    MICROPROCESSORS AND MICROSYSTEMS, 2014, 38 (01) : 31 - 41
  • [45] High-speed memory architectures for multimedia applications
    Oshima, Y
    Sheu, BJ
    Jen, SH
    IEEE CIRCUITS & DEVICES, 1997, 13 (01): : 8 - 13
  • [46] System support for mobile multimedia applications
    Inouye, J
    Cen, SW
    Pu, C
    Walpole, J
    PROCEEDINGS OF THE IEEE 7TH INTERNATIONAL WORKSHOP ON NETWORK AND OPERATING SYSTEM SUPPORT FOR DIGITAL AUDIO AND VIDEO, 1997, : 135 - 146
  • [47] An RFID Management System for Multimedia Applications
    Lai, Yeong-Lin
    Chang, Li-Chih
    Lu, Wei-Chung
    2014 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2014,
  • [48] A distributed system to support multimedia applications
    Sureswaran, R
    Abdullah, R
    MULTIMEDIA INFORMATION SYSTEMS IN PRACTICE, 1999, : 367 - 376
  • [49] Audio rendering system for multimedia applications
    Pierleoni, Paola
    Foretti, Folco
    Cancellieri, Giovanni
    Di Biase, Tommaso
    Pasqualini, Samuele
    Nicolini, Fabrizio
    DISTRIBUTED COOPERATIVE LABORATORIES: NETWORKING, INSTRUMENTATION, AND MEASUREMENTS, 2006, : 61 - +
  • [50] Characterizing the performance benefit of hybrid memory system for HPC applications
    Peng, Ivy Bo
    Gioiosa, Roberto
    Kestor, Gokcen
    Vetter, Jeffrey S.
    Cicotti, Pietro
    Laure, Erwin
    Markidis, Stefano
    PARALLEL COMPUTING, 2018, 76 : 57 - 69