Fast and Accurate Code Placement of Embedded Software for Hybrid On-chip Memory Architecture

被引:0
|
作者
Zhou, Zimeng [1 ]
Ju, Lei [1 ]
Jia, Zhiping [1 ]
Li, Xin [1 ]
机构
[1] Shandong Univ, Sch Comp Sci & Technol, Jinan, Shandong, Peoples R China
关键词
SCRATCH-PAD MEMORY; ALLOCATION; ALGORITHMS; SYSTEMS;
D O I
10.1109/HPCC.2014.169
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Onchip SRAMs including scratchpad memories (SPMs) and caches are widely used in embedded systems to narrow the speed gap between CPU and memory. Memory subsystem acts as both performance and energy bottleneck for many applications in many contemporary embedded systems. While many off-the-shelf embedded processors employ the architecture with hybrid caches and SPMs on-chip memories, many existing work on SPM management ignore the synergy between caches and SPMs. In this work, we propose static SPM allocation strategy for the above-mentioned system architecture with the objective of minimizing the overall instruction memory subsystem latency and/or energy consumption. We capture the cache conflict misses with a fine-grained temporal cache behavior model. We propose an approximate knapsack based heuristic algorithm to generate an outstanding function-level SPM allocation which favors fast design space exploration. Compared with the state-of-the-art SPM heuristic allocation strategy, experimental results show that our SPM management scheme achieves 14.00% further improvement in instruction memory subsystem performance, and up to 17.69% in terms of energy saving.
引用
收藏
页码:1008 / 1015
页数:8
相关论文
共 50 条
  • [1] Fast Shared On-Chip Memory Architecture for Efficient Hybrid Computing with CGRAs
    Lee, Jongeun
    Jeong, Yeonghun
    Seo, Sungsok
    DESIGN, AUTOMATION & TEST IN EUROPE, 2013, : 1575 - 1578
  • [2] Optimizing code allocation for hybrid on-chip memory in IoT systems
    Sun, Zhe
    Zhou, Zimeng
    Fu, Fang-Wei
    INTEGRATION-THE VLSI JOURNAL, 2024, 97
  • [3] A fast on-chip profiler memory
    Lysecky, R
    Cotterell, S
    Vahid, F
    39TH DESIGN AUTOMATION CONFERENCE, PROCEEDINGS 2002, 2002, : 28 - 33
  • [4] On-Chip Memory Architecture Exploration Framework for DSP Processor-Based Embedded System on Chip
    Kumar, T. S. Rajesh
    Govindarajan, R.
    Ravikumar, C. P.
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2012, 11 (01)
  • [5] An analyzable on-chip network architecture for embedded systems
    Luedtke, Daniel
    Tutsch, Dietmar
    Hommel, Guenter
    EMBEDDED SYSTEMS - MODELING, TECHNOLOGY AND APPLICATIONS, PROCEEDINGS, 2006, : 63 - +
  • [6] Unified On-chip Memory Allocation for SIMT Architecture
    Hayes, Ari B.
    Zhang, Eddy Z.
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, (ICS'14), 2014, : 293 - 302
  • [7] Optimal code placement of embedded software for instruction caches
    Tomiyama, H
    Yasuura, H
    EUROPEAN DESIGN & TEST CONFERENCE 1996 - ED&TC 96, PROCEEDINGS, 1996, : 96 - 101
  • [8] Automatic Data Placement into GPU On-Chip Memory Resources
    Li, Chao
    Yang, Yi
    Lin, Zhen
    Zhou, Huiyang
    2015 IEEE/ACM INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO), 2015, : 23 - 33
  • [9] A new accurate and fast arithmetic for on-chip spiral inductors
    Luo, Tianxing
    Shi, Yanling
    Tang, Shenqun
    Ding, Yanfang
    Liu, Yun
    Wang, Yong
    Zhu, Jun
    Chen, Shoumian
    Zhao, Yuhang
    2006 IEEE INTERNATIONAL CONFERENCE ON INFORMATION ACQUISITION, VOLS 1 AND 2, CONFERENCE PROCEEDINGS, 2006, : 545 - 549
  • [10] On-chip learning for a scalable hybrid neural architecture
    Alhalabi, BA
    Bayoumi, MA
    ISCAS '97 - PROCEEDINGS OF 1997 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS I - IV: CIRCUITS AND SYSTEMS IN THE INFORMATION AGE, 1997, : 677 - 680