Efficient Synonym Filtering and Scalable Delayed Translation for Hybrid Virtual Caching

被引:5
|
作者
Park, Chang Hyun [1 ]
Heo, Taekyung [1 ]
Huh, Jaehyuk [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Sch Comp, Daejeon, South Korea
关键词
address translation; hybrid virtual cache; synonym detection; segmented translation;
D O I
10.1109/ISCA.2016.28
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Conventional translation look-aside buffers (TLBs) are required to complete address translation with short latencies, as the address translation is on the critical path of all memory accesses even for L1 cache hits. Such strict TLB latency restrictions limit the TLB capacity, as the latency increase with large TLBs may lower the overall performance even with potential TLB miss reductions. Furthermore, TLBs consume a significant amount of energy as they are accessed for every instruction fetch and data access. To avoid the latency restriction and reduce the energy consumption, virtual caching techniques have been proposed to defer translation to after L1 cache misses. However, an efficient solution for the synonym problem has been a critical issue hindering the wide adoption of virtual caching. Based on the virtual caching concept, this study proposes a hybrid virtual memory architecture extending virtual caching to the entire cache hierarchy, aiming to improve both performance and energy consumption. The hybrid virtual caching uses virtual addresses augmented with address space identifiers (ASID) in the cache hierarchy for common non-synonym addresses. For such non-synonyms, the address translation occurs only after last-level cache (LLC) misses. For uncommon synonym addresses, the addresses are translated to physical addresses with conventional TLBs before L1 cache accesses. To support such hybrid translation, we propose an efficient synonym detection mechanism based on Bloom filters which can identify synonym candidates with few false positives. For large memory applications, delayed translation alone cannot solve the address translation problem, as fixed-granularity delayed TLBs may not scale with the increasing memory requirements. To mitigate the translation scalability problem, this study proposes a delayed many segment translation designed for the hybrid virtual caching. The experimental results show that our approach effectively lowers accesses to the TLBs, leading to significant power savings. In addition, the approach provides performance improvement with scalable delayed translation with variable length segments.
引用
收藏
页码:217 / 229
页数:13
相关论文
共 50 条
  • [21] Efficient Scalable Median Filtering Using Histogram-Based Operations
    Green, Oded
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (05) : 2217 - 2228
  • [22] Scalable and Efficient Neural Speech Coding: A Hybrid Design
    Zhen, Kai
    Sung, Jongmo
    Lee, Mi Suk
    Beack, Seungkwon
    Kim, Minje
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 12 - 25
  • [23] Power-Efficient and Scalable Virtual Router Architecture on FPGA
    Haria, Swapnil
    Ganegedara, Thilan
    Prasanna, Viktor
    2012 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG), 2012,
  • [24] Memory-Efficient and Scalable Virtual Routers Using FPGA
    Le, Hoang
    Ganegedara, Thilan
    Prasanna, Viktor K.
    FPGA 11: PROCEEDINGS OF THE 2011 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD PROGRAMMABLE GATE ARRAYS, 2011, : 257 - 266
  • [25] Efficient image restoration of virtual machines with reference count based rewriting and caching
    Xu, Jiwei
    Wang, Tao
    Xing, Xiaozhao
    Zhang, Wenbo
    Zhong, Hua
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 77 : 87 - 96
  • [26] An Efficient Hybrid I/O Caching Architecture Using Heterogeneous SSDs
    Salkhordeh, Reza
    Hadizadeh, Mostafa
    Asadi, Hossein
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (06) : 1238 - 1250
  • [27] Utopia: Fast and Efficient Address Translation via Hybrid Restrictive & Flexible Virtual-to-Physical Address Mappings
    Kanellopoulos, Konstantinos
    Bera, Rahul
    Stojiljkovic, Kosta
    Bostanci, Nisa
    Firtina, Can
    Ausavarungnirun, Rachata
    Kumar, Rakesh
    Hajinazar, Nastaran
    Sadrosadati, Mohammad
    Vijaykumar, Nandita
    Mutlu, Onur
    56TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, MICRO 2023, 2023, : 1196 - 1212
  • [28] Efficient Page Caching Algorithm with Prediction and Migration for a Hybrid Main Memory
    Seok, Hyunchul
    Park, Youngwoo
    Park, Ki-Woong
    Park, Kyu Ho
    APPLIED COMPUTING REVIEW, 2011, 11 (04): : 38 - 48
  • [29] A hybrid clustering based filtering approach with efficient sequencing
    Mittal, Namita
    Govil, M. C.
    Nayak, Richi
    Kumar, Rajesh
    Gothwal, Himanshu
    Das, Dwipayan
    IMECS 2008: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2008, : 517 - +
  • [30] HyperFuzzer: An Efficient Hybrid Fuzzer for Virtual CPUs
    Ge, Xinyang
    Niu, Ben
    Brotzman, Robert
    Chen, Yaohui
    Han, HyungSeok
    Godefroid, Patrice
    Cui, Weidong
    CCS '21: PROCEEDINGS OF THE 2021 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2021, : 366 - 378