A Machine-Learning Algorithm with Disjunctive Model for Data-Driven Program Analysis

被引:18
|
作者
Jeon, Minseok [1 ]
Jeong, Sehun [1 ]
Cha, Sungdeok [1 ]
Oh, Hakjoo [1 ]
机构
[1] Korea Univ, Dept Comp Sci & Engn, 145 Anam Ro, Seoul 02841, South Korea
关键词
Data-driven program analysis; static analysis; context-sensitivity; flow-sensitivity; POINTS-TO ANALYSIS; CONTEXT-SENSITIVITY; STRATEGY; PRECISE; OCTAGON;
D O I
10.1145/3293607
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present a new machine-learning algorithm with disjunctive model for data-driven program analysis. One major challenge in static program analysis is a substantial amount of manual effort required for tuning the analysis performance. Recently, data-driven program analysis has emerged to address this challenge by automatically adjusting the analysis based on data through a learning algorithm. Although this new approach has proven promising for various program analysis tasks, its effectiveness has been limited due to simple-minded learning models and algorithms that are unable to capture sophisticated, in particular disjunctive, program properties. To overcome this shortcoming, this article presents a new disjunctive model for data-driven program analysis as well as a learning algorithm to find the model parameters. Our model uses Boolean formulas over atomic features and therefore is able to express nonlinear combinations of program properties. A key technical challenge is to efficiently determine a set of good Boolean formulas, as brute-force search would simply be impractical. We present a stepwise and greedy algorithm that efficiently learns Boolean formulas. We show the effectiveness and generality of our algorithm with two static analyzers: context-sensitive points-to analysis for Java and flow-sensitive interval analysis for C. Experimental results show that our automated technique significantly improves the performance of the state-of-the-art techniques including ones hand-crafted by human experts.
引用
收藏
页数:41
相关论文
共 50 条
  • [41] Data-driven predictive prognostic model for power batteries based on machine learning
    Dong, Jinxi
    Yu, Zhaosheng
    Zhang, Xikui
    Luo, Jiajun
    Zou, Qihong
    Feng, Chao
    Ma, Xiaoqian
    PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 2023, 172 : 894 - 907
  • [42] A data-driven predictive maintenance model for hospital HVAC system with machine learning
    Al-Aomar, Raid
    AlTal, Marah
    Abel, Jochen
    BUILDING RESEARCH AND INFORMATION, 2024, 52 (1-2): : 207 - 224
  • [43] An intelligent data-driven model for disease diagnosis based on machine learning theory
    He Huang
    Wei Gao
    Chunming Ye
    Journal of Combinatorial Optimization, 2021, 42 : 884 - 895
  • [44] Data-driven symbol detection via model-based machine learning
    Farsad, Nariman
    Shlezinger, Nir
    Goldsmith, Andrea J.
    Eldar, Yonina C.
    COMMUNICATIONS IN INFORMATION AND SYSTEMS, 2020, 20 (03) : 283 - 317
  • [45] An intelligent data-driven model for disease diagnosis based on machine learning theory
    Huang, He
    Gao, Wei
    Ye, Chunming
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 2021, 42 (04) : 884 - 895
  • [46] D3: Data-Driven Disjunctive Abstraction
    Peleg, Hila
    Shoham, Sharon
    Yahav, Eran
    VERIFICATION, MODEL CHECKING, AND ABSTRACT INTERPRETATION, VMCAI 2016, 2016, 9583 : 185 - 205
  • [47] Data-driven models in machine learning for crime prediction
    Wawrzyniak, Zbigniew M.
    Jankowski, Stanislaw
    Szczechla, Eliza
    Szymanski, Zbigniew
    Pytlak, Radoslaw
    Michalak, Pawel
    Borowik, Grzegorz
    2018 26TH INTERNATIONAL CONFERENCE ON SYSTEMS ENGINEERING (ICSENG 2018), 2018,
  • [48] Chinese diabetes datasets for data-driven machine learning
    Zhao, Qinpei
    Zhu, Jinhao
    Shen, Xuan
    Lin, Chuwen
    Zhang, Yinjia
    Liang, Yuxiang
    Cao, Baige
    Li, Jiangfeng
    Liu, Xiang
    Rao, Weixiong
    Wang, Congrong
    SCIENTIFIC DATA, 2023, 10 (01)
  • [49] Unsupervised machine learning for data-driven representations of reactions
    Sirumalla, Sai Krishna
    West, Richard
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 256
  • [50] Anomaly analytics in data-driven machine learning applications
    Azimi, Shelernaz
    Pahl, Claus
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2025, 19 (01) : 155 - 180