Semi-supervised learning for software quality estimation

被引:0
|
作者
Seliya, N [1 ]
Khoshgoftaar, TM [1 ]
Zhong, S [1 ]
机构
[1] Florida Atlantic Univ, Dept Comp Sci & Engn, Boca Raton, FL 33431 USA
关键词
semi-supervised learning; software quality estimation; unlabeled data; software metrics; expectation maximization;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A software quality estimation model is often built using known software metrics and fault data obtained from program modules of previously developed releases or similar projects. Such a supervised learning approach to software quality estimation assumes that fault data is available for all the previously developed modules. Considering the various practical issues in software project development, fault data may not be available for all the software modules in the training data. More specifically, the available labeled training data is such that a supervised learning approach may not yield good software quality prediction. In contrast, a supervised classification scheme aided by unlabeled data, i.e., semi-supervised learning, may yield better results. This paper investigates semi-supervised learning with the Expectation Maximization (EM) algorithm for the software quality classification problem. Case studies of software measurement data obtained from two NASA software projects, JM1 and KC2, are used in our empirical investigation. A small portion of the JM1 dataset is randomly extracted and used as the labeled data, while the remaining JM1 instances are used as unlabeled data. The performance of the semi-supervised classification models built using the EM algorithm is evaluated by using the KC2 project as a test dataset. It is shown that the EM-based semi-supervised learning scheme improves the predictive accuracy of the software quality classification models.
引用
收藏
页码:183 / 190
页数:8
相关论文
共 50 条
  • [21] GONet: A Semi-Supervised Deep Learning Approach For Traversability Estimation
    Hirose, Noriaki
    Sadeghian, Amir
    Vazquez, Marynel
    Goebel, Patrick
    Savarese, Silvio
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 3044 - 3051
  • [22] Semi-Supervised Depth Estimation by Multi-Task Learning
    Fu, Qingshun
    Dong, Xuan
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3765 - 3771
  • [23] Semi-supervised learning based on high density region estimation
    Chen, Hong
    Li, Luoqing
    Peng, Jiangtao
    NEURAL NETWORKS, 2010, 23 (07) : 812 - 818
  • [24] A Semi-supervised Molecular Learning Framework for Activity Cliff Estimation
    Wu, Fang
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 6080 - 6088
  • [25] Semi-supervised learning by disagreement
    Zhou, Zhi-Hua
    Li, Ming
    KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 24 (03) : 415 - 439
  • [26] A survey on semi-supervised learning
    Jesper E. van Engelen
    Holger H. Hoos
    Machine Learning, 2020, 109 : 373 - 440
  • [27] Semi-supervised Sequence Learning
    Dai, Andrew M.
    Le, Quoc V.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [28] Semi-supervised learning by disagreement
    Zhi-Hua Zhou
    Ming Li
    Knowledge and Information Systems, 2010, 24 : 415 - 439
  • [29] Semi-Supervised Incremental Learning
    Bouchachia, Abdelhamid
    Prossegger, Markus
    Duman, Hakan
    2010 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2010), 2010,
  • [30] Semi-Supervised Learning by Disagreement
    Zhou, Zhi-Hua
    2008 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2008, : 93 - 93