Low-power architecture with scratch-pad memory for accelerating embedded applications with run-time reuse

被引:1
|
作者
Milidonis, A. [1 ]
Porpodas, V. [1 ]
Alachiotis, N. [1 ]
Kakarountas, A. P. [1 ]
Michail, H. [1 ]
Panagiotakopoulos, G. [1 ]
Goutis, C. E. [1 ]
机构
[1] Univ Patras, Dept Elect & Comp Engn, VLSI Design Lab, Patras, Greece
来源
关键词
D O I
10.1049/iet-cdt:20070145
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Current embedded systems are usually designed for data-dominated applications, but they have a tight energy and time budget. Scratch-pad memories are completely software-controlled memories with predictable behaviour and good performance and energy characteristics, thus they tend to become a standard feature in many embedded systems. However, their predictability is not helping if the application accesses its data dynamically, when the addresses of the accessed data depend on the application's input. In such cases, predetermining the scratch-pad content at design-time is not always possible as the compiler cannot predict the runtime input. Moreover, in this case, both data reuse and data placement in the scratch-pad are inefficient because chunks of data already stored cannot be efficently reused and combined with the runtime accessed data blocks. State-of-the art techniques copy each new data block to the scratch-pad without considering whether portions of them are already in it. Such dynamic temporal locality cannot be predicted or exploited by the compiler. The authors here present a system architecture, strongly connected to the system's scratch-pad and the processor's compiler, which is able to efficiently exploit run-time data reuse in the scratch-pad by being capable of holding valuable information, such as the exact data contents of the scratch-pad at runtime, and using it to do all the necessary operations for placing each new data block in scratch-pad. It is. ne tuned for applications with run-time reuse between rectangular data blocks. The application domain of the proposed architecture is multimedia applications with run-time reuse, certain applications with linked lists and multi-threaded applications. It operates in a time and energy-efficient manner when compared with existing scratch-pad architectures without the authors' scratch-pad accelerator engine, showing its higher normalised performance and lower normalised energy consumption. Experimental results show up to 2.5 times performance increase compared with existing scratch-pad architectures and 5 times compared with cache architectures and energy decrease up to 1.9 and 3.9 times, respectively.
引用
收藏
页码:109 / 123
页数:15
相关论文
共 42 条
  • [1] Run-time Scratch-pad Memory Management for Embedded Systems
    Mueck, Tiago Rogerio
    Froehlich, Antonio Augusto
    IECON 2011: 37TH ANNUAL CONFERENCE ON IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2011,
  • [2] Efficient utilization of Scratch-Pad memory in embedded processor applications
    Panda, PR
    Dutt, ND
    Nicolau, A
    EUROPEAN DESIGN & TEST CONFERENCE - ED&TC 97, PROCEEDINGS, 1997, : 7 - 11
  • [3] Low-Power Low-Latency Data Allocation for Hybrid Scratch-Pad Memory
    Qiu, Meikang
    Chen, Zhi
    Liu, Meiqin
    IEEE EMBEDDED SYSTEMS LETTERS, 2014, 6 (04) : 69 - 72
  • [4] Decoupled Processors Architecture for Accelerating Data Intensive Applications using Scratch-Pad Memory Hierarchy
    Milidonis, Athanasios
    Alachiotis, Nikolaos
    Porpodas, Vasileios
    Michail, Harris
    Panagiotakopoulos, Georgios
    Kakarountas, Athanasios P.
    Goutis, Costas E.
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2010, 59 (03): : 281 - 296
  • [5] Decoupled Processors Architecture for Accelerating Data Intensive Applications using Scratch-Pad Memory Hierarchy
    Athanasios Milidonis
    Nikolaos Alachiotis
    Vasileios Porpodas
    Harris Michail
    Georgios Panagiotakopoulos
    Athanasios P. Kakarountas
    Costas E. Goutis
    Journal of Signal Processing Systems, 2010, 59 : 281 - 296
  • [6] A run-time memory management approach for scratch-pad-based embedded systems
    Mueck, Tiago Rogerio
    Froehlich, Antonio Augusto
    2010 IEEE CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2010,
  • [7] Implementation of Stack Data Placement and Run Time Management Using a Scratch-Pad Memory for Energy Consumption Reduction of Embedded Applications
    Gauthier, Lovic
    Ishihara, Tohru
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2011, E94A (12) : 2597 - 2608
  • [8] Scratch-Pad Memory Banking by Dynamic Programming for Embedded Data-Intensive Applications
    Balasa, Florin
    Abuaesh, Noha
    Luican, Ilie I.
    Zhu, Hongwei
    PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2015), 2015, : 485 - 489
  • [9] Memory Allocation for Embedded Systems with a Compile-Time-Unknown Scratch-Pad Size
    Nguyen, Nghi
    Dominguez, Angel
    Barua, Rajeev
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2009, 8 (03)
  • [10] Accelerating Embedded Software Power Profiling Using Run-Time Power Emulation
    Bachmann, Christian
    Genser, Andreas
    Steger, Christian
    Weiss, Reinhold
    Haid, Josef
    INTEGRATED CIRCUIT AND SYSTEM DESIGN: POWER AND TIMING MODELING, OPTIMIZATION AND SIMULATION, 2010, 5953 : 186 - +