Effort-Aware Tri-Training for Semi-supervised Just-in-Time Defect Prediction

被引:5
|
作者
Zhang, Wenzhou [1 ]
Li, Weiwei [2 ]
Jia, Xiuyi [1 ,3 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Coll Astronaut, Nanjing 210016, Peoples R China
[3] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
基金
中国博士后科学基金;
关键词
Defect prediction; Just-in-time; Tri-training; Effort-aware;
D O I
10.1007/978-3-030-16145-3_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, just-in-time (JIT) defect prediction has gained considerable interest as it enables developers to identify risky changes at check-in time. Previous studies tried to conduct research from both supervised and unsupervised perspectives. Since the label of change is hard to acquire, it would be more desirable for applications if a prediction model doesn't highly rely on the label information. However, the performance of the unsupervised models proposed by previous work isn't good in terms of precision and F1 due to the lack of supervised information. To overcome this weakness, we try to study the JIT defect prediction from the semi-supervised perspective, which only requires a few labeled data for training. In this paper, we propose an Effort-Aware Tri-Training (EATT) semi-supervised model for JIT defect prediction based on sample selection. We compare EATT with the state-of-the-art supervised and unsupervised models with respect to different labeled rates. The experimental results on six open-source projects demonstrate that EATT performs better than existing supervised and unsupervised models for effort-aware JIT defect prediction.
引用
收藏
页码:293 / 304
页数:12
相关论文
共 50 条
  • [41] Revisiting Supervised and Unsupervised Methods for Effort-Aware Cross-Project Defect Prediction
    Ni, Chao
    Xia, Xin
    Lo, David
    Chen, Xiang
    Gu, Qing
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2022, 48 (03) : 786 - 802
  • [42] Effort-Aware Just-in-Time Bug Prediction for Mobile Apps Via Cross-Triplet Deep Feature Embedding
    Xu, Zhou
    Zhao, Kunsong
    Zhang, Tao
    Fu, Chunlei
    Yan, Meng
    Xie, Zhiwen
    Zhang, Xiaohong
    Catolino, Gemma
    IEEE TRANSACTIONS ON RELIABILITY, 2022, 71 (01) : 204 - 220
  • [43] Semi-supervised patent text classification method based on improved Tri-training algorithm
    Hu Y.-Q.
    Qiu Q.-Y.
    Yu X.
    Wu J.-W.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2020, 54 (02): : 331 - 339
  • [44] Semi-supervised active learning image classification method based on Tri-Training algorithm
    Zhang, Yongjun
    Yan, Siyu
    PROCEEDINGS OF 2020 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS), 2020, : 206 - 210
  • [45] Semi-Supervised Network Traffic Classification Based on Feature Selection and Improved Tri-training
    Li, Daoquan
    Zhu, Shengkai
    Zhai, Yuyang
    Hu, Yifan
    Computer Engineering and Applications, 2024, 60 (23) : 275 - 285
  • [46] Ternary reversible extreme learning machines: the incremental tri-training method for semi-supervised classification
    Xiao-Liang Tang
    Min Han
    Knowledge and Information Systems, 2010, 23 : 345 - 372
  • [47] A Novel Tri-Training Technique for Semi-Supervised Classification of Hyperspectral Images Based on Diversity Measurement
    Tan, Kun
    Zhu, Jishuai
    Du, Qian
    Wu, Lixin
    Du, Peijun
    REMOTE SENSING, 2016, 8 (09)
  • [48] Learning to rank software modules for effort-aware defect prediction
    Rao, Jiqing
    Yu, Xiao
    Zhang, Chen
    Zhou, Junwei
    Xiang, Jianwen
    2021 21ST INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY COMPANION (QRS-C 2021), 2021, : 372 - 380
  • [49] Semi-Supervised PolSAR Image Classification Based on Improved Tri-Training With a Minimum Spanning Tree
    Wang, Shuang
    Guo, Yanhe
    Hua, Wenqiang
    Liu, Xinan
    Song, Guoxin
    Hou, Biao
    Jiao, Licheng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (12): : 8583 - 8597
  • [50] Ternary reversible extreme learning machines: the incremental tri-training method for semi-supervised classification
    Tang, Xiao-Liang
    Han, Min
    KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 23 (03) : 345 - 372