A Machine-Learning Algorithm with Disjunctive Model for Data-Driven Program Analysis

被引:18
|
作者
Jeon, Minseok [1 ]
Jeong, Sehun [1 ]
Cha, Sungdeok [1 ]
Oh, Hakjoo [1 ]
机构
[1] Korea Univ, Dept Comp Sci & Engn, 145 Anam Ro, Seoul 02841, South Korea
关键词
Data-driven program analysis; static analysis; context-sensitivity; flow-sensitivity; POINTS-TO ANALYSIS; CONTEXT-SENSITIVITY; STRATEGY; PRECISE; OCTAGON;
D O I
10.1145/3293607
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present a new machine-learning algorithm with disjunctive model for data-driven program analysis. One major challenge in static program analysis is a substantial amount of manual effort required for tuning the analysis performance. Recently, data-driven program analysis has emerged to address this challenge by automatically adjusting the analysis based on data through a learning algorithm. Although this new approach has proven promising for various program analysis tasks, its effectiveness has been limited due to simple-minded learning models and algorithms that are unable to capture sophisticated, in particular disjunctive, program properties. To overcome this shortcoming, this article presents a new disjunctive model for data-driven program analysis as well as a learning algorithm to find the model parameters. Our model uses Boolean formulas over atomic features and therefore is able to express nonlinear combinations of program properties. A key technical challenge is to efficiently determine a set of good Boolean formulas, as brute-force search would simply be impractical. We present a stepwise and greedy algorithm that efficiently learns Boolean formulas. We show the effectiveness and generality of our algorithm with two static analyzers: context-sensitive points-to analysis for Java and flow-sensitive interval analysis for C. Experimental results show that our automated technique significantly improves the performance of the state-of-the-art techniques including ones hand-crafted by human experts.
引用
收藏
页数:41
相关论文
共 50 条
  • [21] Dirty engineering data-driven inverse prediction machine learning model
    Jin-Woong Lee
    Woon Bae Park
    Byung Do Lee
    Seonghwan Kim
    Nam Hoon Goo
    Kee-Sun Sohn
    Scientific Reports, 10
  • [22] Dirty engineering data-driven inverse prediction machine learning model
    Lee, Jin-Woong
    Park, Woon Bae
    Lee, Byung Do
    Kim, Seonghwan
    Goo, Nam Hoon
    Sohn, Kee-Sun
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [23] Data-Driven Machine Learning Model for Aircraft Icing Severity Evaluation
    Li, Sibo
    Qin, Jingkun
    Paoli, Roberto
    JOURNAL OF AEROSPACE INFORMATION SYSTEMS, 2021, 18 (11): : 876 - 880
  • [24] A Data-Driven Emotion Model for English Learners Based on Machine Learning
    Zheng, Zhao
    Na, Kew Si
    INTERNATIONAL JOURNAL OF EMERGING TECHNOLOGIES IN LEARNING, 2021, 16 (08) : 34 - 46
  • [25] Data-driven decarbonization framework with machine learning
    Jain, Ayush
    Padmanaban, Manikandan
    Hazra, Jagabondhu
    Guruprasad, Ranjini
    Godbole, Shantanu
    Syam, Heriansyah
    ENVIRONMENTAL DATA SCIENCE, 2024, 3
  • [26] Data-Driven Suitability Analysis to Enable Machine Learning Explainability and Security
    Wolf, Shaya
    Foster, Rita
    Haile, Jed
    Borowczak, Mike
    2021 RESILIENCE WEEK (RWS), 2021,
  • [27] Sensitivity Analysis of the Composite Data-Driven Pipelines in the Automated Machine Learning
    Barabanova, Irina, V
    Vychuzhanin, Pavel
    Nikitin, Nikolay O.
    10TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE IN COMPUTATIONAL SCIENCE (YSC2021), 2021, 193 : 484 - 493
  • [28] Data-driven approaches to hard-to-treat tuberculosis disease: a machine-learning based model for automated recommendation of individualized treatment
    Verboven, Lennert
    Van Rie, Annelies
    Tu, Trang
    INTERNATIONAL JOURNAL OF INFECTIOUS DISEASES, 2025, 152
  • [29] A Machine-Learning Based Approach for Data-Driven Identification of Heating Dynamics of Buildings' Living-Spaces
    Broujeny, Roozbeh Sadeghian
    Madani, Kurosh
    Chebira, Abdennasser
    Amarger, Veronique
    Hurtard, Laurent
    PROCEEDINGS OF THE 2019 10TH IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS - TECHNOLOGY AND APPLICATIONS (IDAACS), VOL. 1, 2019, : 197 - 202
  • [30] Data-Driven and Machine-Learning Methods to Project Coronavirus Disease 2019 Pandemic Trend in Eastern Mediterranean
    Huang, Wenbo
    Ao, Shuang
    Han, Dan
    Liu, Yuming
    Liu, Shuang
    Huang, Yaojiang
    FRONTIERS IN PUBLIC HEALTH, 2021, 9