Classification of mislabelled microarrays using robust sparse logistic regression

被引:31
|
作者
Bootkrajang, Jakramate [1 ]
Kaban, Ata [1 ]
机构
[1] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England
关键词
DISCRIMINANT-ANALYSIS; INITIAL SAMPLES; GENE SELECTION; CANCER;
D O I
10.1093/bioinformatics/btt078
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Previous studies reported that labelling errors are not uncommon in microarray datasets. In such cases, the training set may become misleading, and the ability of classifiers to make reliable inferences from the data is compromised. Yet, few methods are currently available in the bioinformatics literature to deal with this problem. The few existing methods focus on data cleansing alone, without reference to classification, and their performance crucially depends on some tuning parameters. Results: In this article, we develop a new method to detect mislabelled arrays simultaneously with learning a sparse logistic regression classifier. Our method may be seen as a label-noise robust extension of the well-known and successful Bayesian logistic regression classifier. To account for possible mislabelling, we formulate a label-flipping process as part of the classifier. The regularization parameter is automatically set using Bayesian regularization, which not only saves the computation time that cross-validation would take, but also eliminates any unwanted effects of label noise when setting the regularization parameter. Extensive experiments with both synthetic data and real microarray datasets demonstrate that our approach is able to counter the bad effects of labelling errors in terms of predictive performance, it is effective at identifying marker genes and simultaneously it detects mislabelled arrays to high accuracy.
引用
收藏
页码:870 / 877
页数:8
相关论文
共 50 条
  • [31] Liver Patient Classification using Logistic Regression
    Adil, Syed Hasan
    Ebrahim, Mansoor
    Raza, Kamran
    Ali, Syed Saad Azhar
    Hashmani, Manzoor Ahmed
    2018 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCOINS), 2018,
  • [32] Robust and sparse estimation methods for high-dimensional linear and logistic regression
    Kurnaz, Fatma Sevinc
    Hoffmann, Irene
    Filzmoser, Peter
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2018, 172 : 211 - 222
  • [33] Robust sparse discriminative least squares regression for image classification
    Yang, Zhangjing
    Wang, Dingan
    Huang, Pu
    Wan, Minghua
    Zhang, Fanlong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 150
  • [34] Robust Photometric Stereo using Sparse Regression
    Ikehata, Satoshi
    Wipf, David
    Matsushita, Yasuyuki
    Aizawa, Kiyoharu
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 318 - 325
  • [35] Spectral-Spatial Classification of Hyperspectral Images Using CNNs and Approximate Sparse Multinomial Logistic Regression
    Kutluk, Sezer
    Kayabol, Koray
    Akan, Aydin
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [36] Classification of breast lesions in ultrasonography using sparse logistic regression and morphology-based texture features
    Nemat, Hoda
    Fehri, Hamid
    Ahmadinejad, Nasrin
    Frangi, Alejandro F.
    Gooya, Ali
    MEDICAL PHYSICS, 2018, 45 (09) : 4112 - 4124
  • [37] Gene Selection in Cancer Classification Using Sparse Logistic Regression with L1/2 Regularization
    Wu, Shengbing
    Jiang, Hongkun
    Shen, Haiwei
    Yang, Ziyi
    APPLIED SCIENCES-BASEL, 2018, 8 (09):
  • [38] BAYESIAN LOGISTIC REGRESSION WITH SPARSE GENERAL REPRESENTATION PRIOR FOR MULTISPECTRAL IMAGE CLASSIFICATION
    Serra, Juan G.
    Ruiz, Pablo
    Molina, Rafael
    Katsaggelos, Aggelos K.
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 1893 - 1897
  • [39] IMAGE SEGMENTATION USING SPARSE LOGISTIC REGRESSION WITH SPATIAL PRIOR
    Ruusuvuori, Pekka
    Manninen, Tapio
    Huttunen, Heikki
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2253 - 2257
  • [40] Incorporating Symmetric Smooth Regularizations into Sparse Logistic Regression for Classification and Feature Extraction
    Wang, Jing
    Xie, Xiao
    Wang, Pengwei
    Sun, Jian
    Liu, Yaochen
    Zhang, Li
    SYMMETRY-BASEL, 2025, 17 (02):