Optimal linear ensemble of binary classifiers

被引:1
|
作者
Ahsen, Mehmet Eren [1 ,2 ,5 ]
Vogel, Robert [3 ,4 ]
Stolovitzky, Gustavo [3 ]
机构
[1] Univ Illinois, Dept Business Adm, 1206 S Sixth St, Champaign, IL 61820 USA
[2] Univ Illinois, Dept Biomed & Translat Sci, Urbana, IL 61801 USA
[3] IBM Corp, Thomas J Watson Res Ctr, New York, NY 10598 USA
[4] Scripps Res, Dept Integrated Struct & Computat Biol, La Jolla, CA 92037 USA
[5] Univ Illinois, Dept Biomed & Translat Sci, 1206 S Sixth St, Champaign, IL 61820 USA
来源
BIOINFORMATICS ADVANCES | 2024年 / 4卷 / 01期
关键词
PREDICTION; CHALLENGE; AREA; CARE;
D O I
10.1093/bioadv/vbae093
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Motivation The integration of vast, complex biological data with computational models offers profound insights and predictive accuracy. Yet, such models face challenges: poor generalization and limited labeled data.Results To overcome these difficulties in binary classification tasks, we developed the Method for Optimal Classification by Aggregation (MOCA) algorithm, which addresses the problem of generalization by virtue of being an ensemble learning method and can be used in problems with limited or no labeled data. We developed both an unsupervised (uMOCA) and a supervised (sMOCA) variant of MOCA. For uMOCA, we show how to infer the MOCA weights in an unsupervised way, which are optimal under the assumption of class-conditioned independent classifier predictions. When it is possible to use labels, sMOCA uses empirically computed MOCA weights. We demonstrate the performance of uMOCA and sMOCA using simulated data as well as actual data previously used in Dialogue on Reverse Engineering and Methods (DREAM) challenges. We also propose an application of sMOCA for transfer learning where we use pre-trained computational models from a domain where labeled data are abundant and apply them to a different domain with less abundant labeled data.Availability and implementation GitHub repository, https://github.com/robert-vogel/moca.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Ensemble analysis on syndrome entropy of binary linear codes
    Wadayama, Tadashi
    2006 IEEE International Symposium on Information Theory, Vols 1-6, Proceedings, 2006, : 1559 - 1563
  • [32] On the Interpretation of Ensemble Classifiers in Terms of Bayes Classifiers
    Tri Le
    Bertrand Clarke
    Journal of Classification, 2018, 35 : 198 - 229
  • [33] On the Interpretation of Ensemble Classifiers in Terms of Bayes Classifiers
    Le, Tri
    Clarke, Bertrand
    JOURNAL OF CLASSIFICATION, 2018, 35 (02) : 198 - 229
  • [34] An optimal intrusion detection system using recursive feature elimination and ensemble of classifiers
    Sharma, Neha, V
    Yadav, Narendra Singh
    MICROPROCESSORS AND MICROSYSTEMS, 2021, 85
  • [35] Deep Ensemble of Classifiers for Alzheimer's Disease Detection with Optimal Feature Set
    Rajasree, R. S.
    Rajakumari, S. Brintha
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2023,
  • [36] An evolutionary algorithm approach to optimal ensemble classifiers for DNA microarray data analysis
    Kim, Kyung-Joong
    Cho, Sung-Bae
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2008, 12 (03) : 377 - 388
  • [37] Ensemble of HOSVD Generated Tensor Subspace Classifiers with Optimal Tensor Flattening Directions
    Cyganek, Boguslaw
    Wozniak, Michal
    Jankowski, Dariusz
    Hybrid Artificial Intelligent Systems, 2016, 9648 : 560 - 571
  • [38] Optimal selection of ensemble classifiers using particle swarm optimization and diversity measures
    Hasanpour, Hesam
    Meibodi, Ramak Ghavamizadeh
    Navi, Keivan
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2019, 13 (01): : 131 - 137
  • [39] Ensemble of binary SVM classifiers based on PCA and LDA feature extraction for intrusion detection
    Aburomman, Abdulla Amin
    Reaz, Mamun Bin Ibne
    PROCEEDINGS OF 2016 IEEE ADVANCED INFORMATION MANAGEMENT, COMMUNICATES, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IMCEC 2016), 2016, : 636 - 640
  • [40] Optimal linear combination using decision reliability of individual classifiers
    Lu, Z
    Ding, XQ
    PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 44 - 47