Semi-supervised learning for software quality estimation

被引:0
|
作者
Seliya, N [1 ]
Khoshgoftaar, TM [1 ]
Zhong, S [1 ]
机构
[1] Florida Atlantic Univ, Dept Comp Sci & Engn, Boca Raton, FL 33431 USA
关键词
semi-supervised learning; software quality estimation; unlabeled data; software metrics; expectation maximization;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A software quality estimation model is often built using known software metrics and fault data obtained from program modules of previously developed releases or similar projects. Such a supervised learning approach to software quality estimation assumes that fault data is available for all the previously developed modules. Considering the various practical issues in software project development, fault data may not be available for all the software modules in the training data. More specifically, the available labeled training data is such that a supervised learning approach may not yield good software quality prediction. In contrast, a supervised classification scheme aided by unlabeled data, i.e., semi-supervised learning, may yield better results. This paper investigates semi-supervised learning with the Expectation Maximization (EM) algorithm for the software quality classification problem. Case studies of software measurement data obtained from two NASA software projects, JM1 and KC2, are used in our empirical investigation. A small portion of the JM1 dataset is randomly extracted and used as the labeled data, while the remaining JM1 instances are used as unlabeled data. The performance of the semi-supervised classification models built using the EM algorithm is evaluated by using the KC2 project as a test dataset. It is shown that the EM-based semi-supervised learning scheme improves the predictive accuracy of the software quality classification models.
引用
收藏
页码:183 / 190
页数:8
相关论文
共 50 条
  • [41] Semi-supervised distribution learning
    Wen, Mengtao
    Jia, Yinxu
    Ren, Haojie
    Wang, Zhaojun
    Zou, Changliang
    BIOMETRIKA, 2024, 112 (01)
  • [42] Universal Semi-Supervised Learning
    Huang, Zhuo
    Xue, Chao
    Han, Bo
    Yang, Jian
    Gong, Chen
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [43] A semi-supervised learning framework for decision modeling of software project management
    Sadabadi, Ali Tizkar (al_tz2@yahoo.com), 1600, Springer Verlag (465):
  • [44] Label propagation based semi-supervised learning for software defect prediction
    Zhang, Zhi-Wu
    Jing, Xiao-Yuan
    Wang, Tie-Jian
    AUTOMATED SOFTWARE ENGINEERING, 2017, 24 (01) : 47 - 69
  • [45] Software Defect Prediction Using Semi-supervised Learning with Dimension Reduction
    Lu, Huihua
    Cukic, Bojan
    Culp, Mark
    2012 PROCEEDINGS OF THE 27TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE), 2012, : 314 - 317
  • [46] Semi-supervised machine learning approach for unknown malicious software detection
    Bisio, Federica
    Gastaldo, Paolo
    Zunino, Rodolfo
    Decherchi, Sergio
    2014 IEEE INTERNATIONAL SYMPOSIUM ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA 2014), 2014, : 52 - 59
  • [47] A Semi-supervised Learning Framework for Decision Modeling of Software Project Management
    Sadabadi, Ali Tizkar
    INFORMATION AND SOFTWARE TECHNOLOGIES, ICIST 2014, 2014, 465 : 134 - 149
  • [48] Label propagation based semi-supervised learning for software defect prediction
    Zhi-Wu Zhang
    Xiao-Yuan Jing
    Tie-Jian Wang
    Automated Software Engineering, 2017, 24 : 47 - 69
  • [49] Adversarial Dropout for Supervised and Semi-Supervised Learning
    Park, Sungrae
    Park, JunKeon
    Shin, Su-Jin
    Moon, Il-Chul
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3917 - 3924
  • [50] Supervised and semi-supervised machine learning ranking
    Vittaut, Jean-Noel
    Gallinari, Patrick
    COMPARATIVE EVALUATION OF XML INFORMATION RETRIEVAL SYSTEMS, 2007, 4518 : 213 - 222