Low-power architecture with scratch-pad memory for accelerating embedded applications with run-time reuse

被引:1
|
作者
Milidonis, A. [1 ]
Porpodas, V. [1 ]
Alachiotis, N. [1 ]
Kakarountas, A. P. [1 ]
Michail, H. [1 ]
Panagiotakopoulos, G. [1 ]
Goutis, C. E. [1 ]
机构
[1] Univ Patras, Dept Elect & Comp Engn, VLSI Design Lab, Patras, Greece
来源
关键词
D O I
10.1049/iet-cdt:20070145
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Current embedded systems are usually designed for data-dominated applications, but they have a tight energy and time budget. Scratch-pad memories are completely software-controlled memories with predictable behaviour and good performance and energy characteristics, thus they tend to become a standard feature in many embedded systems. However, their predictability is not helping if the application accesses its data dynamically, when the addresses of the accessed data depend on the application's input. In such cases, predetermining the scratch-pad content at design-time is not always possible as the compiler cannot predict the runtime input. Moreover, in this case, both data reuse and data placement in the scratch-pad are inefficient because chunks of data already stored cannot be efficently reused and combined with the runtime accessed data blocks. State-of-the art techniques copy each new data block to the scratch-pad without considering whether portions of them are already in it. Such dynamic temporal locality cannot be predicted or exploited by the compiler. The authors here present a system architecture, strongly connected to the system's scratch-pad and the processor's compiler, which is able to efficiently exploit run-time data reuse in the scratch-pad by being capable of holding valuable information, such as the exact data contents of the scratch-pad at runtime, and using it to do all the necessary operations for placing each new data block in scratch-pad. It is. ne tuned for applications with run-time reuse between rectangular data blocks. The application domain of the proposed architecture is multimedia applications with run-time reuse, certain applications with linked lists and multi-threaded applications. It operates in a time and energy-efficient manner when compared with existing scratch-pad architectures without the authors' scratch-pad accelerator engine, showing its higher normalised performance and lower normalised energy consumption. Experimental results show up to 2.5 times performance increase compared with existing scratch-pad architectures and 5 times compared with cache architectures and energy decrease up to 1.9 and 3.9 times, respectively.
引用
收藏
页码:109 / 123
页数:15
相关论文
共 42 条
  • [21] Design Space Exploration for Low-Power Memory Systems in Embedded Signal Processing Applications
    Balasa, Florin
    Gingu, Cristian V.
    Luican, Ilie I.
    Zhu, Hongwei
    2013 IEEE 19TH INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS (RTCSA), 2013, : 92 - 100
  • [22] Memory design and exploration for low-power embedded applications: A case study on hash algorithms
    Bhattacharya, Debojyoti
    Saha, Avishek
    ADCOM 2007: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATIONS, 2007, : 479 - 484
  • [23] Low-power 2D motion estimation architecture with complementary embedded memory banks
    Kim, KS
    Lee, K
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2000, 10 (5-6) : 229 - 237
  • [24] Memory allocation for low-power real-time embedded microcontroller: a case study
    Zhang, Zhishen
    Shen, Yuwen
    Sun, Binqi
    Kloda, Tomasz
    Caccamo, Marco
    2022 IEEE 27TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2022,
  • [25] Real-time memory efficient SLIC accelerator for low-power applications
    Khamaneh, Paria Ansar
    Khakpour, Ali
    Shoaran, Maryam
    Karimian, Ghader
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (22) : 32449 - 32467
  • [26] Nano-mechanical cantilever arrays for low-power and low-voltage embedded nonvolatile memory applications
    Smith, Charles G.
    van Kampen, Rob
    Popp, Jens
    Lacy, Damian
    Pinchetti, Don
    Renault, Michael
    Joshi, Vikram
    Beunder, Mike A.
    MEMS/MOEMS COMPONENTS AND THEIR APPLICATIONS IV, 2007, 6464
  • [27] High-performance and low-power memory-interface architecture for video processing applications
    Kim, H
    Park, IC
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2001, 11 (11) : 1160 - 1170
  • [28] An novel low power embedded memory architecture for MPEG-4 applications with mobile devices
    Sayed, M
    Badawy, W
    SYSTEM-ON-CHIP FOR REAL-TIME APPLICATIONS, 2003, : 262 - 272
  • [29] Data-reuse-driven energy-aware cosynthesis of scratch pad memory and hierarchical bus-based communication architecture for multiprocessor streaming applications
    Issenin, Ilya
    Brockmeyer, Erik
    Durinck, Bart
    Dutt, Nikil D.
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2008, 27 (08) : 1439 - 1452
  • [30] A new single-poly flash memory cell with low-voltage and low-power operations for embedded applications
    Chi, MH
    Bergemont, A
    55TH ANNUAL DEVICE RESEARCH CONFERENCE, DIGEST - 1997, 1997, : 126 - 127