ParaLog: Enabling and Accelerating Online Parallel Monitoring of Multithreaded Applications

被引:2
|
作者
Vlachos, Evangelos [1 ]
Goodstein, Michelle L. [1 ]
Kozuch, Michael A. [2 ]
Chen, Shimin [2 ]
Falsafi, Babak [3 ]
Gibbons, Phillip B. [2 ]
Mowry, Todd C. [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Intel Labs Pittsburgh, Pittsburgh, PA USA
[3] Ecole Polytech Fed Lausanne, CH-1015 Lausanne, Switzerland
关键词
Design; Performance; Reliability; Security; Online Parallel Monitoring; Hardware Support for Debugging; Instruction-grain Lifeguards;
D O I
10.1145/1735971.1736051
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Instruction-grain lifeguards monitor the events of a running application at the level of individual instructions in order to identify and help mitigate application bugs and security exploits. Because such lifeguards impose a 10-100X slowdown on existing platforms, previous studies have proposed hardware designs to accelerate lifeguard processing. However, these accelerators are either tailored to a specific class of lifeguards or suitable only for monitoring single-threaded programs. We present ParaLog, the first design of a system enabling fast online parallel monitoring of multithreaded parallel applications. ParaLog supports a broad class of software-defined lifeguards. We show how three existing accelerators can be enhanced to support online multithreaded monitoring, dramatically reducing lifeguard overheads. We identify and solve several challenges in monitoring parallel applications and/or parallelizing these accelerators, including (i) enforcing inter-thread data dependences, (ii) dealing with inter-thread effects that are not reflected in coherence traffic, (iii) dealing with unmonitored operating system activity, and (iv) ensuring lifeguards can access shared metadata with negligible synchronization overheads. We present our system design for both Sequentially Consistent and Total Store Ordering processors. We implement and evaluate our design on a 16 core simulated CMP, using benchmarks from SPLASH-2 and PARSEC and two lifeguards: a data-flow tracking lifeguard and a memory-access checker lifeguard. Our results show that (i) our parallel accelerators improve performance by 2-9X and 1.13-3.4X for our two lifeguards, respectively, (ii) we are 5-126X faster than the time-slicing approach required by existing techniques, and (iii) our average overheads for applications with eight threads are 51% and 28% for the two lifeguards, respectively.
引用
收藏
页码:271 / 283
页数:13
相关论文
共 50 条
  • [31] BeeSwarm: Enabling Parallel Scaling Performance Measurement in Continuous Integration for HPC Applications
    Tronge, Jake
    Chen, Jieyang
    Grubel, Patricia
    Randles, Tim
    Davis, Rusty
    Wofford, Quincy
    Anaya, Steven
    Guan, Qiang
    2021 36TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING ASE 2021, 2021, : 1136 - 1140
  • [32] Designing parallel data processing for enabling large-scale sensor applications
    Milan Kabáč
    Charles Consel
    Nic Volanschi
    Personal and Ubiquitous Computing, 2017, 21 : 457 - 473
  • [33] A Review of Research Aspects of Redundantly Actuated Parallel Robotsw for Enabling Further Applications
    Mostashiri, Naser
    Dhupia, Jaspreet S.
    Verl, Alexander W.
    Xu, Weiliang
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2018, 23 (03) : 1259 - 1269
  • [34] Designing parallel data processing for enabling large-scale sensor applications
    Kabac, Milan
    Consel, Charles
    Volanschi, Nic
    PERSONAL AND UBIQUITOUS COMPUTING, 2017, 21 (03) : 457 - 473
  • [35] Online scheduling of parallel programs on heterogeneous systems with applications to Cilk
    Bender, MA
    Rabin, MO
    THEORY OF COMPUTING SYSTEMS, 2002, 35 (03) : 289 - 304
  • [36] Online root-cause performance analysis of parallel applications
    Sikora, Anna
    Margalef, Tomas
    Jorba, Josep
    PARALLEL COMPUTING, 2015, 48 : 81 - 107
  • [37] Online perfonnance observation of large-scale parallel applications
    Malony, AD
    Shende, S
    Bell, R
    PARALLEL COMPUTING: SOFTWARE TECHNOLOGY, ALGORITHMS, ARCHITECTURES AND APPLICATIONS, 2004, 13 : 761 - 768
  • [38] Distributed parallel algorithms for online virtual network embedding applications
    Lu, Qiao
    Nguyen, Khoa
    Huang, Changcheng
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2023, 36 (01)
  • [39] Online Scheduling of Parallel Programs on Heterogeneous Systems with Applications to Cilk
    Michael A. Bender
    Michael O. Rabin
    Theory of Computing Systems, 2002, 35 : 289 - 304
  • [40] Modelbased online monitoring of batch processes for practical applications
    Christmann, U
    Hanisch, HM
    SMC '97 CONFERENCE PROCEEDINGS - 1997 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: CONFERENCE THEME: COMPUTATIONAL CYBERNETICS AND SIMULATION, 1997, : 726 - 731