Crohn's Disease Prediction Using Sequence Based Machine Learning Analysis of Human Microbiome

被引:3
|
作者
Unal, Metehan [1 ]
Bostanci, Erkan [1 ]
Ozkul, Ceren [2 ]
Acici, Koray [3 ]
Asuroglu, Tunc [4 ]
Guzel, Mehmet Serdar [1 ]
机构
[1] Ankara Univ, Dept Comp Engn, TR-06830 Ankara, Turkiye
[2] Hacettepe Univ, Fac Pharm, Dept Pharmaceut Microbiol, TR-06110 Ankara, Turkiye
[3] Ankara Univ, Dept Artificial Intelligence & Data Engn, TR-06830 Ankara, Turkiye
[4] Tampere Univ, Fac Med & Hlth Technol, FI-33720 Tampere, Finland
关键词
microbiota; Machine Learning; bowel disease; bioinformatics; ALGORITHMS;
D O I
10.3390/diagnostics13172835
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Human microbiota refers to the trillions of microorganisms that inhabit our bodies and have been discovered to have a substantial impact on human health and disease. By sampling the microbiota, it is possible to generate massive quantities of data for analysis using Machine Learning algorithms. In this study, we employed several modern Machine Learning techniques to predict Inflammatory Bowel Disease using raw sequence data. The dataset was obtained from NCBI preprocessed graph representations and converted into a structured form. Seven well-known Machine Learning frameworks, including Random Forest, Support Vector Machines, Extreme Gradient Boosting, Light Gradient Boosting Machine, Gaussian Naive Bayes, Logistic Regression, and k-Nearest Neighbor, were used. Grid Search was employed for hyperparameter optimization. The performance of the Machine Learning models was evaluated using various metrics such as accuracy, precision, fscore, kappa, and area under the receiver operating characteristic curve. Additionally, Mc Nemar's test was conducted to assess the statistical significance of the experiment. The data was constructed using k-mer lengths of 3, 4 and 5. The Light Gradient Boosting Machine model overperformed over other models with 67.24%, 74.63% and 76.47% accuracy for k-mer lengths of 3, 4 and 5, respectively. The LightGBM model also demonstrated the best performance in each metric. The study showed promising results predicting disease from raw sequence data. Finally, Mc Nemar's test results found statistically significant differences between different Machine Learning approaches.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] IoT-based disease prediction using machine learning
    Siddiqui, Salman Ahmad
    Ahmad, Anwar
    Fatima, Neda
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 108
  • [22] MACHINE LEARNING BASED PREDICTION OF INCIDENT CASES OF CROHN'S DISEASE USING ELECTRONIC HEALTH RECORDS FROM A LARGE INTEGRATED HEALTH SYSTEM
    Hugo, Julian
    Ibing, Susanne
    Borchert, Florian
    Sachs, Jan P.
    Ungaro, Ryan C.
    Bottinger, Erwin P.
    GASTROENTEROLOGY, 2023, 164 (06) : S1170 - S1170
  • [23] Machine Learning Based Prediction of Incident Cases of Crohn's Disease Using Electronic Health Records from a Large Integrated Health System
    Hugo, Julian
    Ibing, Susanne
    Borchert, Florian
    Sachs, Jan Philipp
    Cho, Judy
    Ungaro, Ryan C.
    Boettinger, Erwin P.
    ARTIFICIAL INTELLIGENCE IN MEDICINE, AIME 2023, 2023, 13897 : 293 - 302
  • [24] Spaciotemporal machine learning analysis of complete small bowel capsule endoscopy videos for prediction of outcomes in Crohn's disease
    Kellerman, R.
    Bleiweiss, A.
    Samuel, S.
    Barzilay, O.
    Yehuda, R. Margalit
    ZImlichman, E.
    Eliakim, R.
    Ben-Horin, S.
    Klang, E.
    Kopylov, U.
    JOURNAL OF CROHNS & COLITIS, 2022, 16 : I308 - I309
  • [25] Machine Learning-based Prediction for Early Progression in Korean Crohn's disease: Results from the IMPACT Study
    Chun, J.
    Suji, K.
    Sung, A. Kwang
    Kyung, P. Soo
    Sangsoo, K.
    Dong, P., II
    JOURNAL OF CROHNS & COLITIS, 2022, 16 : I596 - I596
  • [26] Machine Learning-based Prediction for Early Progression in Korean Crohn's disease: Results from the IMPACT Study
    Chun, J.
    Suji, K.
    Sung, A. Kwang
    Kyung, P. Soo
    Sangsoo, K.
    Il, P. Dong
    JOURNAL OF CROHNS & COLITIS, 2022, 16 : I596 - I596
  • [27] A study on a hybrid water quality prediction model using sequence to sequence learning based LSTM And machine learning
    Yoon, Sukmin
    Shin, Jaeho
    Park, No-Suk
    Kweon, Minjae
    Kim, Youngsoon
    DESALINATION AND WATER TREATMENT, 2024, 320
  • [28] Diagnosis of Crohn’s disease and ulcerative colitis using the microbiome
    Da-Yeon Kang
    Jong-Lyul Park
    Min-Kyung Yeo
    Sang-Bum Kang
    Jin-Man Kim
    Ju Seok Kim
    Seon-Young Kim
    BMC Microbiology, 23
  • [29] Diagnosis of Crohn's disease and ulcerative colitis using the microbiome
    Kang, Da-Yeon
    Park, Jong-Lyul
    Yeo, Min-Kyung
    Kang, Sang-Bum
    Kim, Jin-Man
    Kim, Ju Seok
    Kim, Seon-Young
    BMC MICROBIOLOGY, 2023, 23 (01)
  • [30] Discrete sequence prediction using machine learning methods
    Sharif, H
    Conner, M
    IC-AI '04 & MLMTA'04 , VOL 1 AND 2, PROCEEDINGS, 2004, : 1097 - 1101