CaPPS: cache partitioning with partial sharing for multi-core embedded systems

被引:0
|
作者
Wei Zang
Ann Gordon-Ross
机构
[1] SK Hynix Memory Solution,Department of Electrical and Computer Engineering
[2] University of Florida,undefined
来源
关键词
Cache memories; Modeling techniques; Optimization; Performance evaluation;
D O I
暂无
中图分类号
学科分类号
摘要
As the number of cores in chip multi-processor systems increases, the contention over shared last-level cache (LLC) resources increases, thus making LLC optimization critical, especially for embedded systems with strict area/energy/power constraints. We propose cache partitioning with partial sharing (CaPPS), which reduces LLC contention using cache partitioning and improves utilization with sharing configuration. Sharing configuration enables the partitions to be privately allocated to a single core, partially shared with a subset of cores, or fully shared with all cores based on the co-executing applications’ requirements. CaPPS imposes low hardware overhead and affords an extensive design space to increase optimization potential. To facilitate fast design space exploration, we develop an analytical model to quickly estimate the miss rates of all CaPPS configurations using the applications’ isolated LLC access traces to predict runtime LLC contention. Experimental results demonstrate that the analytical model estimates cache miss rates with an average error of only 0.73 % and with an average speedup of 3505×\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3505\times $$\end{document} as compared to a cycle-accurate simulator. Due to CaPPS’s extensive design space, CaPPS can reduce the average LLC miss rate by as much as 25 % as compared to baseline configurations and as much as 14–17 % as compared to prior works.
引用
收藏
页码:65 / 92
页数:27
相关论文
共 50 条
  • [41] Embedded Multi-Core Systems Dedicated to Dynamic Dataflow Programs
    Yviquel, Herve
    Sanchez, Alexandre
    Jaaskelainen, Pekka
    Takala, Jarmo
    Raulet, Mickael
    Casseau, Emmanuel
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2015, 80 (01): : 121 - 136
  • [42] Insights on memory controller scaling in multi-core embedded systems
    Marino, Mario Donato
    Li, Kuan-Ching
    INTERNATIONAL JOURNAL OF EMBEDDED SYSTEMS, 2014, 6 (04) : 351 - 361
  • [43] Design Issues in a Performance Monitor for Embedded Multi-core Systems
    Chao-Sheng Lin
    Chih-Sheng Lin
    Pao-Ann Hsiung
    William C. Chu
    Journal of Signal Processing Systems, 2014, 75 : 141 - 154
  • [44] Portable CSP Based Design for Embedded Multi-Core Systems
    Sputh, Bernhard H. C.
    Faust, Oliver
    Allen, Alastair R.
    COMMUNICATING PROCESS ARCHITECTURES 2006: WOTUG-29, 2006, 64 : 123 - 134
  • [45] Directory cache design for multi-core processor
    State Key Laboratory of High-End Server & Storage Technology , Beijing
    100085, China
    Jisuanji Yanjiu yu Fazhan, 6 (1242-1253):
  • [46] Multi-Core HTB for Bandwidth Sharing
    Jia, Chengjun
    Fu, Zhe
    Hu, Xiaohe
    Cao, Shui
    Wang, Liang
    Li, Jun
    PROCEEDINGS OF THE 2018 SYMPOSIUM ON ARCHITECTURES FOR NETWORKING AND COMMUNICATIONS SYSTEMS (ANCS '18), 2018, : 169 - 171
  • [47] Writeback-Aware Bandwidth Partitioning for Multi-core Systems with PCM
    Zhou, Miao
    Du, Yu
    Childers, Bruce R.
    Melhem, Rami
    Mosse, Daniel
    2013 22ND INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT), 2013, : 113 - 122
  • [48] A Data-sharing Aware and Scalable Cache Miss Rates Model for Multi-core Processors with Multi-level Cache Hierarchies
    Wang, Guangmin
    Ge, Jiancong
    Yan, Yunhao
    Ling, Ming
    2019 IEEE 25TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2019, : 267 - 274
  • [49] Analytical Modeling the Multi-Core Shared Cache Behavior With Considerations of Data-Sharing and Coherence
    Ling, Ming
    Lu, Xiaoqian
    Wang, Guangmin
    Ge, Jiancong
    IEEE ACCESS, 2021, 9 : 17728 - 17743
  • [50] Impact of Cache Partitioning on Multi-Tasking Real Time Embedded Systems
    Bui, Bach D.
    Caccamo, Marco
    Sha, Lui
    Martinez, Joseph
    RTCSA 2008: 14TH IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS - PROCEEDINGS, 2008, : 101 - +