Predicting osteoarthritis in adults using statistical data mining and machine learning

被引:5
|
作者
Bertoncelli, Carlo M. [1 ,2 ,3 ]
Altamura, Paola [4 ]
Bagui, Sikha [1 ]
Bagui, Subhash [1 ]
Vieira, Edgar Ramos [5 ]
Costantini, Stefania [3 ]
Monticone, Marco [6 ,7 ,8 ]
Solla, Federico [2 ]
Bertoncelli, Domenico [1 ,3 ]
机构
[1] Univ West Florida, Hal Marcus Coll Sci & Engn, Dept Comp Sci, Pensacola, FL 32514 USA
[2] Lenval Univ, Pediat Hosp Nice, Dept Pediat Orthopaed Surg, Nice, France
[3] Univ Aquila, Dept Informat Engn Comp Sci & Math, Laquila, Italy
[4] Univ G dAnnunzio, Dept Med Chem & Pharmaceut Technol, Chieti, Italy
[5] Florida Int Univ, Dept Phys Therapy, Miami, FL 33199 USA
[6] Univ Cagliari, Dept Med Sci & Publ Hlth, Cagliari, Italy
[7] Univ Cagliari, Dept Phys Med & Rehabil, Cagliari, Italy
[8] Univ Cagliari, G Brotzu Hosp, Dept Neurosci & Rehabil, Neurorehabil Unit, Cagliari, Italy
关键词
arthritis; machine learning; osteoarthritis; statistical data mining; KNEE OSTEOARTHRITIS; EPIDEMIOLOGY; MODEL; VALIDATION; SCOLIOSIS; ARTHRITIS; BURDEN; STATE; HIP;
D O I
10.1177/1759720X221104935
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: Osteoarthritis (OA) has traditionally been considered a disease of older adults (>= 65 years old), but it may appear in younger adults. However, the risk factors for OA in younger adults need to be further evaluated. Objectives: To develop a prediction model for identifying risk factors of OA in subjects aged 20-50 years and compare the performance of different machine learning models. Methods: We included data from 52,512 participants of the National Health and Nutrition Examination Survey; of those, we analyzed only subjects aged 20-50 years (n = 19,133), with or without OA. The supervised machine learning model 'Deep PredictMed' based on logistic regression, deep neural network (DNN), and support vector machine was used for identifying demographic and personal characteristics that are associated with OA. Finally, we compared the performance of the different models. Results: Being a female (p < 0.001), older age (p < 0.001), a smoker (p < 0.001), higher body mass index (p < 0.001), high blood pressure (p < 0.001), race/ethnicity (lowest risk among Mexican Americans, p = 0.01), and physical and mental limitations (p < 0.001) were associated with having OA. Best predictive performance yielded a 75% area under the receiver operating characteristic curve. Conclusion: Sex (female), age (older), smoking (yes), body mass index (higher), blood pressure (high), race/ethnicity, and physical and mental limitations are risk factors for having OA in adults aged 20-50 years. The best predictive performance was achieved using DNN algorithms.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Predicting Malicious Software in IoT Environment Based on Machine Learning and Data Mining Techniques
    Alharbi, Abdulmohsen
    Hamid, Abdul
    Lahza, Husam
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (08) : 497 - 506
  • [42] Predicting the Dynamic Behaviour of a Concrete Dam using Statistical and Machine Learning Models
    Pereira, Sérgio
    Mata, Juan
    Magalhães, Filipe
    Gomes, Jorge
    Cunha, Álvaro
    e-Journal of Nondestructive Testing, 2024, 29 (07):
  • [43] Prediction of pain in knee osteoarthritis patients using machine learning: Data from Osteoarthritis Initiative
    Alexos, Antonios
    Kokkotis, Christos
    Moustakidis, Serafeim
    Papageorgiou, Elpiniki
    2020 11TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS (IISA 2020), 2020, : 240 - 246
  • [44] Predicting data science performance from log data: using machine learning
    Doleck, Tenzin
    Agand, Pedram
    Pirrotta, Dylan
    EDUCATION AND INFORMATION TECHNOLOGIES, 2025,
  • [45] Predicting Amyloid Positivity in Cognitively Unimpaired Older Adults A Machine Learning Approach Using A4 Data
    Petersen, Kellen K.
    Lipton, Richard B.
    Grober, Ellen
    Davatzikos, Christos
    Sperling, Reisa A.
    Ezzati, Ali
    NEUROLOGY, 2022, 98 (24) : E2425 - E2435
  • [46] Predicting body weight through biometric measurements in growing hair sheep using data mining and machine learning algorithms
    Ignacio Vázquez-Martínez
    Cem Tırınk
    Rosario Salazar-Cuytun
    Jesus A. Mezo-Solis
    Ricardo A. Garcia Herrera
    José Felipe Orzuna-Orzuna
    Alfonso J. Chay-Canul
    Tropical Animal Health and Production, 2023, 55
  • [47] Predicting body weight through biometric measurements in growing hair sheep using data mining and machine learning algorithms
    Vazquez-Martinez, Ignacio
    Tirink, Cem
    Salazar-Cuytun, Rosario
    Mezo-Solis, Jesus A.
    Herrera, Ricardo A. Garcia
    Orzuna-Orzuna, Jose Felipe
    Chay-Canul, Alfonso J.
    TROPICAL ANIMAL HEALTH AND PRODUCTION, 2023, 55 (05)
  • [48] An adaptive correlation based video data mining using machine learning
    Mallikharjuna, L. K.
    Reddy, V. S. K.
    INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2020, 24 (01) : 1 - 9
  • [49] CLASSIFICATION OF FACIAL EXPRESSIONS USING DATA MINING AND MACHINE LEARNING ALGORITHMS
    Faria, Brigida Monica
    Lau, Nuno
    Reis, Luis Paulo
    SISTEMAS E TECHNOLOGIAS DE INFORMACAO: ACTAS DA 4A CONFERENCIA IBERICA DE SISTEMAS E TECNOLOGIAS DE LA INFORMACAO, 2009, : 197 - +
  • [50] Data Mining For Predicting Students' Learning Result
    Wati, Masna
    Indrawan, Wahyu
    Widians, Joan Angelina
    Puspitasari, Novianti
    2017 4TH INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS AND INFORMATION PROCESSING TECHNOLOGY (CAIPT), 2017, : 37 - 40