Effort-Aware Tri-Training for Semi-supervised Just-in-Time Defect Prediction

被引:5
|
作者
Zhang, Wenzhou [1 ]
Li, Weiwei [2 ]
Jia, Xiuyi [1 ,3 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Coll Astronaut, Nanjing 210016, Peoples R China
[3] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
基金
中国博士后科学基金;
关键词
Defect prediction; Just-in-time; Tri-training; Effort-aware;
D O I
10.1007/978-3-030-16145-3_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, just-in-time (JIT) defect prediction has gained considerable interest as it enables developers to identify risky changes at check-in time. Previous studies tried to conduct research from both supervised and unsupervised perspectives. Since the label of change is hard to acquire, it would be more desirable for applications if a prediction model doesn't highly rely on the label information. However, the performance of the unsupervised models proposed by previous work isn't good in terms of precision and F1 due to the lack of supervised information. To overcome this weakness, we try to study the JIT defect prediction from the semi-supervised perspective, which only requires a few labeled data for training. In this paper, we propose an Effort-Aware Tri-Training (EATT) semi-supervised model for JIT defect prediction based on sample selection. We compare EATT with the state-of-the-art supervised and unsupervised models with respect to different labeled rates. The experimental results on six open-source projects demonstrate that EATT performs better than existing supervised and unsupervised models for effort-aware JIT defect prediction.
引用
收藏
页码:293 / 304
页数:12
相关论文
共 50 条
  • [31] Tri-training and data editing based semi-supervised clustering algorithm
    Deng, Chao
    Guo, Mao-Zu
    Ruan Jian Xue Bao/Journal of Software, 2008, 19 (03): : 663 - 673
  • [32] A Novel Semi-supervised SVM Based on Tri-training for Intrusition Detection
    Li, Jimin
    Zhang, Wei
    Li, KunLun
    JOURNAL OF COMPUTERS, 2010, 5 (04) : 638 - 645
  • [33] Pattern Recognition for Partial Discharge Based on Tri-Training Semi-Supervised Learning
    Yao, Linpeng
    Wang, Hui
    Qian, Yong
    Huang, Chengjun
    Jiang, Xiuchen
    INTERNATIONAL REVIEW OF ELECTRICAL ENGINEERING-IREE, 2010, 5 (06): : 2673 - 2678
  • [34] A semi-supervised soft sensor modeling method based on the Tri-training GPR
    Ma, Junxia
    Li, Lintao
    Xiong, Weili
    Huagong Xuebao/CIESC Journal, 2024, 75 (07): : 2613 - 2623
  • [35] Semi-supervised active learning algorithm for SVMs based on QBC and tri-training
    Hailong Xu
    Longyue Li
    Pengsong Guo
    Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 8809 - 8822
  • [36] Semi-supervised Classification Model Based on Ladder Network and Improved Tri-training
    Mo J.-W.
    Jia P.
    Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (08): : 2088 - 2096
  • [37] Semi-supervised active learning algorithm for SVMs based on QBC and tri-training
    Xu, Hailong
    Li, Longyue
    Guo, Pengsong
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (09) : 8809 - 8822
  • [38] Classification of Hyperspectral Data Based on Semi-supervised Tri-training Learning Framework
    Huang, Rui
    Zhou, Lina
    ADVANCED MATERIALS IN MICROWAVES AND OPTICS, 2012, 500 : 374 - 382
  • [39] Just-in-time semi-supervised soft sensor for quality prediction in industrial rubber mixers
    Zheng, Wenjian
    Liu, Yi
    Gao, Zengliang
    Yang, Jianguo
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2018, 180 : 36 - 41
  • [40] Revisiting 'revisiting supervised methods for effort-aware cross-project defect prediction'
    Li, Fuyang
    Yang, Peixin
    Keung, Jacky Wai
    Hu, Wenhua
    Luo, Haoyu
    Yu, Xiao
    IET SOFTWARE, 2023, 17 (04) : 472 - 495