Transactional Prefetching: Narrowing the Window of Contention in Hardware Transactional Memory

被引:0
|
作者
Negi, Anurag [1 ]
Armejach, Adria [2 ,3 ]
Cristal, Adrian [2 ,4 ]
Unsal, Osman S. [2 ]
Stenstrom, Per [1 ]
机构
[1] Chalmers Univ Technol, Gothenburg, Sweden
[2] Barcelona Supercomp Ctr, Barcelona, Spain
[3] Univ Politecn Cataluna, Barcelona, Spain
[4] Spanish Natl Res Council, CSIC, IIIA, Barcelona, Spain
关键词
hardware transactional memory; multicores; prefetching;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Memory access latency is the primary performance bottle-neck in modern computer systems. Prefetching data before it is needed by a processing core allows substantial performance gains by overlapping significant portions of memory latency with useful work. Prior work has investigated this technique and measured potential benefits in a variety of scenarios. However, its use in speeding up Hardware Transactional Memory (HTM) has remained hitherto unexplored. In several HTM designs transactions invalidate speculatively updated cache lines when they abort. Such cache lines tend to have high locality and are likely to be accessed again when the transaction re-executes. Coarse grained transactions that update several cache lines are particularly susceptible to performance degradation even under moderate contention. However, such transactions show strong locality of reference, especially when contention is high. Prefetching cache lines with high locality can, therefore, improve overall concurrency by speeding up transactions and, thereby, narrowing the window of time in which such transactions persist and can cause contention. Such transactions are important since they are likely to form a common TM use-case. We note that traditional prefetch techniques may not be able to track such lines adequately or issue prefetches quickly enough. This paper investigates the use of prefetching in HTMs, proposing a simple design to identify and request prefetch candidates, and measures performance gains to be had for several representative TM workloads.
引用
收藏
页码:181 / 190
页数:10
相关论文
共 50 条
  • [21] Performance pathologies in hardware transactional memory
    Bobba, Jayaram
    Moore, Kevin E.
    Volos, Haris
    Yen, Luke
    Hill, Mark D.
    Swift, Michael M.
    Wood, David A.
    IEEE MICRO, 2008, 28 (01) : 32 - 41
  • [22] Hardware Transactional Memory for GPU Architectures
    Fung, Wilson W. L.
    Singh, Inderpreet
    Brownsword, Andrew
    Aamodt, Tor M.
    PROCEEDINGS OF THE 2011 44TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO 44), 2011, : 296 - 307
  • [23] DHTM: Durable Hardware Transactional Memory
    Joshi, Arpit
    Nagarajan, Vijay
    Cintra, Marcelo
    Viglas, Stratis
    2018 ACM/IEEE 45TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2018, : 452 - 465
  • [24] Hardware Transactional Memory meets Memory Persistency
    Castro, Daniel
    Romano, Paolo
    Barreto, Joao
    2018 32ND IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2018, : 368 - 377
  • [25] Performance Pathologies in Hardware Transactional Memory
    Bobba, Jayaram
    Moore, Kevin E.
    Volos, Haris
    Yen, Luke
    Hill, Mark D.
    Swift, Michael M.
    Wood, David A.
    ISCA'07: 34TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, CONFERENCE PROCEEDINGS, 2007, : 81 - 91
  • [26] An Analytical Model of Hardware Transactional Memory
    Castro, Daniel
    Romano, Paolo
    Didona, Diego
    Zwaenepoel, Willy
    2017 IEEE 25TH INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS), 2017, : 221 - 231
  • [27] Integrating Caching and Prefetching Mechanisms in a Distributed Transactional Memory
    Dash, Alokika
    Demsky, Brian
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2011, 22 (08) : 1284 - 1298
  • [28] Transactional Event Profiling in a Best-Effort Hardware Transactional Memory System
    Gaudet, Matthew
    Amaral, Jose Nelson
    PROCEEDINGS OF THE 21ST INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT'12), 2012, : 475 - 475
  • [29] Mileage-based Contention Management in Transactional Memory
    Choi, Woojin
    Zhao, Lihang
    Draper, Jeff
    PROCEEDINGS OF THE 21ST INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT'12), 2012, : 471 - 471
  • [30] Providing QoS in Contention Management for Software Transactional Memory
    Fahmy, Sherif F.
    Senousy, Zakaria
    Amin, Ahmed F.
    2017 13TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO), 2017, : 231 - 236