CaPPS: cache partitioning with partial sharing for multi-core embedded systems

被引:0
|
作者
Wei Zang
Ann Gordon-Ross
机构
[1] SK Hynix Memory Solution,Department of Electrical and Computer Engineering
[2] University of Florida,undefined
来源
关键词
Cache memories; Modeling techniques; Optimization; Performance evaluation;
D O I
暂无
中图分类号
学科分类号
摘要
As the number of cores in chip multi-processor systems increases, the contention over shared last-level cache (LLC) resources increases, thus making LLC optimization critical, especially for embedded systems with strict area/energy/power constraints. We propose cache partitioning with partial sharing (CaPPS), which reduces LLC contention using cache partitioning and improves utilization with sharing configuration. Sharing configuration enables the partitions to be privately allocated to a single core, partially shared with a subset of cores, or fully shared with all cores based on the co-executing applications’ requirements. CaPPS imposes low hardware overhead and affords an extensive design space to increase optimization potential. To facilitate fast design space exploration, we develop an analytical model to quickly estimate the miss rates of all CaPPS configurations using the applications’ isolated LLC access traces to predict runtime LLC contention. Experimental results demonstrate that the analytical model estimates cache miss rates with an average error of only 0.73 % and with an average speedup of 3505×\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3505\times $$\end{document} as compared to a cycle-accurate simulator. Due to CaPPS’s extensive design space, CaPPS can reduce the average LLC miss rate by as much as 25 % as compared to baseline configurations and as much as 14–17 % as compared to prior works.
引用
收藏
页码:65 / 92
页数:27
相关论文
共 50 条
  • [31] Predictable Cache Coherence for Multi-Core Real-Time Systems
    Hassan, Mohamed
    Kaushik, Anirudh M.
    Patel, Hiren
    PROCEEDINGS OF THE 23RD IEEE REAL-TIME AND EMBEDDED TECHNOLOGY AND APPLICATIONS SYMPOSIUM (RTAS 2017), 2017, : 235 - 246
  • [32] Cache Interference-aware Task Partitioning for Non-preemptive Real-time Multi-core Systems
    Xiao, Jun
    Shen, Yixian
    Pimentel, Andy D.
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2022, 21 (03)
  • [33] Cache Efficiency and Scalability on Multi-core Architectures
    Mueller, Thomas
    Trinitis, Carsten
    Smajic, Jasmin
    PARALLEL COMPUTING TECHNOLOGIES, 2011, 6873 : 88 - +
  • [34] A Cache Utility Monitor for Multi-core Processor
    Fang, Juan
    Cheng, Yan-Jin
    Cai, Min
    Chang, Ze-Qing
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATION AND SENSOR NETWORKS (WCSN 2016), 2016, 44 : 561 - 565
  • [35] Developing Predictable Vehicular Distributed Embedded Systems on Multi-core
    Mubeen, Saad
    Nolte, Thomas
    Lundback, Kurt-Lennart
    INFORMATION TECHNOLOGY: NEW GENERATIONS, 2016, 448 : 1273 - 1277
  • [36] MESS: Memory Performance Debugging on Embedded Multi-core Systems
    Chattopadhyay, Sudipta
    MODEL CHECKING SOFTWARE, SPIN 2015, 2015, 9232 : 105 - 125
  • [37] FIDES: A multi-core platform to enhance robustness of embedded systems
    Inoue, Hiroaki
    Sato, Naoki
    NEC TECHNICAL JOURNAL, 2006, 1 (03): : 51 - 54
  • [38] Off-Chip Memory Bandwidth Minimization through Cache Partitioning for Multi-Core Platforms
    Yu, Chenjie
    Petrov, Peter
    PROCEEDINGS OF THE 47TH DESIGN AUTOMATION CONFERENCE, 2010, : 132 - 137
  • [39] Embedded Multi-Core Systems Dedicated to Dynamic Dataflow Programs
    Hervé Yviquel
    Alexandre Sanchez
    Pekka Jääskeläinen
    Jarmo Takala
    Mickaël Raulet
    Emmanuel Casseau
    Journal of Signal Processing Systems, 2015, 80 : 121 - 136
  • [40] Design Issues in a Performance Monitor for Embedded Multi-core Systems
    Lin, Chao-Sheng
    Lin, Chih-Sheng
    Hsiung, Pao-Ann
    Chu, William C.
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2014, 75 (02): : 141 - 154