A new machine learning approach to optimize correlated biomarkers

被引:0
|
作者
Lee, Ya-Hsun [1 ]
Chen, Yi-Hau [2 ]
Guo, Chao-Yu [1 ,3 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Inst Publ Hlth, Coll Med, Taipei, Taiwan
[2] Acad Sinica, Inst Stat Sci, Taipei, Taiwan
[3] Natl Yang Ming Chiao Tung Univ, Inst Stat, Hsinchu, Taiwan
关键词
Biomarkers combination; diagnosis accuracy; machine learning; statistical boosting; Youden Index; CLASSIFICATION;
D O I
10.1080/03610926.2025.2477289
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The number of novel biomarkers is booming. However, a simple predictive score is more feasible to evaluate the clinical outcome and provide better accuracy. However, the optimal linear combination of correlated biomarkers demands comprehensive methodological research. This research aims to develop a novel approach for interpretable optimization. This research proposes the gradient boost machine with the Youden Index (GBYI) as the target function. The rationale is that the gradient boost machine demonstrates superior prediction ability and provides excellent interpretations according to the linear model. In addition, the Youden Index could effortlessly estimate the optimal cutoff point of the diagnostic test and evaluate the overall accuracy. Simulation studies evaluate the performance of the GBYI with linear and nonlinear structured datasets. We also demonstrate an application in the Bupa Liver Disease Data, which revealed that our optimal combination of correlated biomarkers shows an improved prediction with higher accuracy. This research proposes a novel machine-learning strategy using the powerful statistical boosting technique of the Youden Index. The new machine could optimize the combination of high-dimensional data and provide attractive interpretable coefficients.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] A Machine Learning Approach for Identifying Gene Biomarkers Guiding the Treatment of Breast Cancer
    Abou Tabl, Ashraf
    Alkhateeb, Abedalrhman
    ElMaraghy, Waguih
    Rueda, Luis
    Ngom, Alioune
    FRONTIERS IN GENETICS, 2019, 10
  • [32] A new approach to optimize a tool-path of a five-axis milling machine
    Makhanov, SS
    Sonthipaumpoon, K
    PROCEEDINGS OF SECOND INTERNATIONAL WORKSHOP ON CSCW IN DESIGN, 1997, : 567 - 574
  • [33] Machine-Learning Approach to Optimize SMOTE Ratio in Class Imbalance Dataset for Intrusion Detection
    Seo, Jae-Hyun
    Kim, Yong-Hyuk
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2018, 2018
  • [34] A New Machine Learning Approach for Actual Calcium Measurement
    Kumari, Suchitra
    Nayak, Saurav
    Mangaraj, Manaswini
    INDIAN JOURNAL OF CLINICAL BIOCHEMISTRY, 2025, 40 (02) : 300 - 306
  • [35] A new machine learning approach to seabed biotope classification
    Cooper, Keith M.
    Barry, Jon
    OCEAN & COASTAL MANAGEMENT, 2020, 198
  • [36] New approach for luminescence sensing based on machine learning
    Venturini, Francesca
    Baumgartner, Michael
    Michelucci, Umberto
    OPTICAL DATA SCIENCE II, 2019, 10937
  • [37] A machine learning (ML) approach for identifying genetic biomarkers and new targets associated with impaired survival of breast cancer patients
    Martin, G. Sanz
    Martelli, V. Doldan
    Izquierdo, J. Del Castillo
    del Campo, P. Gomez
    Galmarini, C. M.
    Correa, J. M. Dominguez
    EUROPEAN JOURNAL OF CANCER, 2022, 175 : S82 - S82
  • [38] Machine Learning and Graph Theory to Optimize Drinking Water
    Amali, Said
    EL Faddouli, Nour-eddine
    Boutoulout, Ali
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS2017), 2018, 127 : 310 - 319
  • [39] Use of machine learning to optimize actuator configuration on an airfoil
    Tadjfar, M.
    Kamari, Dj.
    Tarokh, A.
    JOURNAL OF FLUIDS AND STRUCTURES, 2024, 128
  • [40] Machine learning and glioma imaging biomarkers
    Booth, T. C.
    Williams, M.
    Luis, A.
    Cardoso, J.
    Ashkan, K.
    Shuaib, H.
    CLINICAL RADIOLOGY, 2020, 75 (01) : 20 - 32