A Hierarchical Feature Ensemble Deep Learning Approach for Software Defect Prediction

被引:2
|
作者
Zhang, Shenggang [1 ]
Jiang, Shujuan [1 ]
Yan, Yue [1 ]
机构
[1] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou 221116, Peoples R China
基金
中国国家自然科学基金;
关键词
Software defect prediction; deep learning; abstract syntax tree; class dependency network; ensemble learning;
D O I
10.1142/S0218194023500079
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software defect prediction can detect modules that may have defects in advance and optimize resource allocation to improve test efficiency and reduce development costs. Traditional features cannot capture deep semantic and grammatical information, which limits the further development of software defect prediction. Therefore, it has gradually become a trend to use deep learning technology to automatically learn valuable deep features from source code or relevant data. However, most software defect prediction methods based on deep learning extraction features from a single information source or only use a single deep learning model, which leads to the fact that the extracted features are not comprehensive enough to affect the final prediction performance. In view of this, this paper proposes a Hierarchical Feature Ensemble Deep Learning (HFEDL) Approach for software defect prediction. Firstly, the HFEDL approach needs to obtain three types of information sources: abstract syntax tree (AST), class dependency network (CDN) and traditional features. Then, the Convolutional Neural Network (CNN) and the Bidirectional Long Short-Term Memory based on Attention mechanism (BiLSTM+Attention) are used to extract different valuable features from the three information sources and multiple prediction sub-models are constructed. Next, all the extracted features are fused by a filter mechanism to obtain more comprehensive features and construct a fusion prediction sub-model. Finally, all the sub-models are integrated by an ensemble learning method to obtain the final prediction model. We use 11 projects in the PROMISE defect repository and evaluate our approach in both non-effort-aware and effort-aware scenarios. The experimental results show that the prediction performance of our approach is superior to state-of-the-art methods in both scenarios.
引用
收藏
页码:543 / 573
页数:31
相关论文
共 50 条
  • [1] Feature Clustering and Ensemble Learning Based Approach for Software Defect Prediction
    Srivastava R.
    Jain A.K.
    Recent Advances in Computer Science and Communications, 2022, 15 (06): : 868 - 882
  • [2] Deep Semantic Feature Learning for Software Defect Prediction
    Wang, Song
    Liu, Taiyue
    Nam, Jaechang
    Tan, Lin
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2020, 46 (12) : 1267 - 1293
  • [3] LDFR: Learning deep feature representation for software defect prediction
    Xu, Zhou
    Li, Shuai
    Xu, Jun
    Liu, Jin
    Luo, Xiapu
    Zhang, Yifeng
    Zhang, Tao
    Keung, Jacky
    Tang, Yutian
    JOURNAL OF SYSTEMS AND SOFTWARE, 2019, 158
  • [4] Software Defect Prediction: A Machine Learning Approach with Voting Ensemble
    Mosquera, Marcela
    Hurtado, Remigio
    PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2024, VOL 3, 2024, 1013 : 585 - 595
  • [5] An Ensemble Learning Approach for Software Defect Prediction in Developing Quality Software Product
    Saheed, Yakub Kayode
    Longe, Olumide
    Baba, Usman Ahmad
    Rakshit, Sandip
    Vajjhala, Narasimha Rao
    ADVANCES IN COMPUTING AND DATA SCIENCES, PT I, 2021, 1440 : 317 - 326
  • [6] Ensemble learning based software defect prediction
    Dong, Xin
    Liang, Yan
    Miyamoto, Shoichiro
    Yamaguchi, Shingo
    JOURNAL OF ENGINEERING RESEARCH, 2023, 11 (04): : 377 - 391
  • [7] Software Defect Prediction Based Ensemble Approach
    Harikiran J.
    Chandana B.S.
    Srinivasarao B.
    Raviteja B.
    Reddy T.S.
    Computer Systems Science and Engineering, 2023, 45 (03): : 2313 - 2331
  • [8] Enhancing software defect prediction: a framework with improved feature selection and ensemble machine learning
    Ali, Misbah
    Mazhar, Tehseen
    Al-Rasheed, Amal
    Shahzad, Tariq
    Ghadi, Yazeed Yasin
    Khan, Muhammad Amir
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [9] Deep Semantic Feature Learning with Embedded Static Metrics for Software Defect Prediction
    Fan, Guisheng
    Diao, Xuyang
    Yu, Huiqun
    Yang, Kang
    Chen, Liqiong
    2019 26TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC), 2019, : 244 - 251
  • [10] Feature Selection and Software Defect Prediction by Different Ensemble Classifiers
    Shakhovska, Natalya
    Yakovyna, Vitaliy
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2021, PT I, 2021, 12923 : 307 - 313