Enhancing metastatic colorectal cancer prediction through advanced feature selection and machine learning techniques

被引:1
|
作者
Yang, Hui [1 ,2 ]
Liu, Jun [3 ]
Yang, Na [4 ,5 ]
Fu, Qingsheng [3 ]
Wang, Yingying [6 ]
Ye, Mingquan [7 ]
Tao, Shaoneng [6 ]
Liu, Xiaocen [6 ]
Li, Qingqing [7 ]
机构
[1] Yijishan Hosp, Affiliated Hosp 1, Wannan Med Coll, Cent Lab, Wuhu, Anhui, Peoples R China
[2] Anhui Prov Key Lab Noncoding RNA Basic & Clin Tran, Wuhu, Anhui, Peoples R China
[3] Yijishan Hosp, Affiliated Hosp 1, Wannan Med Coll, Dept Gastrointestinal Surg, Wuhu, Anhui, Peoples R China
[4] Yijishan Hosp, Affiliated Hosp 1, Wannan Med Coll, Dept Crit Care Med, Wuhu, Anhui, Peoples R China
[5] Clin Res Ctr Crit Resp Med Anhui Prov, Wuhu, Anhui, Peoples R China
[6] Yijishan Hosp, Affiliated Hosp 1, Wannan Med Coll, Dept Nucl Med, Wuhu 241001, Anhui, Peoples R China
[7] Wannan Med Coll, Res Ctr Hlth Big Data Min & Applicat, Sch Med Informat, Wuhu, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
Colorectal cancer; Metastasis prediction; Feature selection; Machine learning; EXPRESSION;
D O I
10.1016/j.intimp.2024.113033
中图分类号
R392 [医学免疫学]; Q939.91 [免疫学];
学科分类号
100102 ;
摘要
Background and aims: Colorectal cancer (CRC) is the third most prevalent cancer globally, posing a significant challenge due to its high rate of metastasis. Approximately 20% of patients with CRC present with distant metastases at diagnosis, and over 50% develop metastases within five years. Accurate prediction of metastasis is crucial for improving survival outcomes in patients with CRC. Methods: This study introduces an innovative cost-sensitive fast correlation-based filter (CS-FCBF) algorithm for feature selection, integrated with machine learning techniques to predict metastatic CRC. The CS-FCBF algorithm effectively reduced the number of genomic features from 184 to 9 critical genes: CXCL9, C2CD4B, RGCC, GFI1, BEX2, CXCL3, FOXQ1, PBK, and PLAG1. The methodology combined in vitro, in vivo, and analysis of publicly available single-cell RNA-seq datasets to validate the findings. Results: The application of the CS-FCBF algorithm led to a significant improvement in prediction model performance, with an average 21.16% increase in the area under the precision-recall curve. The nine identified genes hold potential as diagnostic biomarkers and therapeutic targets for metastatic CRC. Conclusions: This study highlights the critical role of advanced feature selection methods, combined with machine learning, in addressing the challenge of class imbalance in medical diagnosis, particularly for CRC. Early detection of metastasis is vital, and the identified genes underscore their importance in the metastatic process of CRC. The methodology applied here offers valuable insights and paves the way for future research in other cancers or diseases that face similar diagnostic challenges.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Enhancing brain cancer type prediction through machine learning algorithms and feature selection techniques
    Gu, Cong
    Ren, Siyv
    JOURNAL OF PHYSICS A-MATHEMATICAL AND THEORETICAL, 2024, 57 (42)
  • [2] Enhancing Regional Wind Power Forecasting through Advanced Machine-Learning and Feature-Selection Techniques
    Taheri, Nabi
    Tucci, Mauro
    ENERGIES, 2024, 17 (21)
  • [3] Enhancing a machine learning model for predicting agricultural drought through feature selection techniques
    Nikdad, Pardis
    Ghaleni, Mehdi Mohammadi
    Moghaddasi, Mahnoosh
    Pradhan, Biswajeet
    APPLIED WATER SCIENCE, 2024, 14 (06)
  • [4] Prediction of Cardiovascular Disease by Feature Selection and Machine Learning Techniques
    Ranade, Aditya
    Pise, Nitin
    ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 2, AITA 2023, 2024, 844 : 457 - 472
  • [5] Assessing Feature Selection Techniques for a Colorectal Cancer Prediction Model
    Cueto-Lopez, Nahum
    Alaiz-Rodriguez, Rocio
    Teresa Garcia-Ordas, Maria
    Gonzalez-Donquiles, Carmen
    Martin, Vicente
    INTERNATIONAL JOINT CONFERENCE SOCO'17- CISIS'17-ICEUTE'17 PROCEEDINGS, 2018, 649 : 471 - 481
  • [6] Enhancing Software Requirements Classification with Machine Learning and Feature Selection Techniques
    Lanfear, Daniel
    Maleki, Mina
    Banitaan, Shadi
    SOFTWARE AND DATA ENGINEERING, SEDE 2024, 2025, 2244 : 14 - 30
  • [7] Obsolescence Prediction based on Joint Feature Selection and Machine Learning Techniques
    Trabelsi, Imen
    Zeddini, Besma
    Zolghadri, Marc
    Barkallah, Maher
    Haddar, Mohamed
    ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2021, : 787 - 794
  • [8] Enhancing Software Cost Estimation Using Feature Selection and Machine Learning Techniques
    Mansoor, Fizza
    Alim, Muhammad Affan
    Jilani, Muhammad Taha
    Alam, Muhammad Monsoor
    Su’ud, Mazliham Mohd
    Computers, Materials and Continua, 2024, 81 (03): : 4603 - 4624
  • [9] Solar Flare Prediction Using Advanced Feature Extraction, Machine Learning, and Feature Selection
    Omar W. Ahmed
    Rami Qahwaji
    Tufan Colak
    Paul A. Higgins
    Peter T. Gallagher
    D. Shaun Bloomfield
    Solar Physics, 2013, 283 : 157 - 175
  • [10] Solar Flare Prediction Using Advanced Feature Extraction, Machine Learning, and Feature Selection
    Ahmed, Omar W.
    Qahwaji, Rami
    Colak, Tufan
    Higgins, Paul A.
    Gallagher, Peter T.
    Bloomfield, D. Shaun
    SOLAR PHYSICS, 2013, 283 (01) : 157 - 175