Distributed learning on 20 000+lung cancer patients - The Personal Health Train

被引:89
|
作者
Deist, Timo M. [1 ,2 ]
Dankers, Frank J. W. M. [1 ,3 ]
Ojha, Priyanka [4 ]
Marshall, M. Scott [4 ]
Janssen, Tomas [4 ]
Faivre-Finn, Corinne [5 ]
Masciocchi, Carlotta [7 ]
Valentini, Vincenzo [6 ,7 ]
Wang, Jiazhou [8 ]
Chen, Jiayan [8 ]
Zhang, Zhen [8 ]
Spezi, Emiliano [9 ,10 ]
Button, Mick [10 ]
Nuyttens, Joost Jan [1 ,11 ]
Vernhout, Rene [11 ]
van Soest, Johan
Jochems, Arthur [2 ]
Monshouwer, Rene [3 ]
Bussink, Johan [3 ]
Price, Gareth [5 ]
Lambin, Philippe [2 ]
Dekker, Andre [1 ]
机构
[1] Maastricht Univ Med Ctr, GROW Sch Oncol & Dev Biol, Dept Radiat Oncol MAASTRO, Maastricht, Netherlands
[2] Maastricht Univ Med Ctr, GROW Sch Oncol & Dev Biol, D Lab Dept Precis Med, Maastricht, Netherlands
[3] Radboud Univ Nijmegen, Med Ctr, Dept Radiat Oncol, Nijmegen, Netherlands
[4] Netherlands Canc Inst Antoni van Leeuwenhoek, Dept Radiat Oncol, Amsterdam, Netherlands
[5] Univ Manchester, Manchester Acad Hlth Sci Ctr, Christie NHS Fdn Trust, Manchester, Lancs, England
[6] Univ Cattolica Sacro Cuore, Milan, Italy
[7] Fdn Policlin Univ A Gemelli IRCCS, Rome, Italy
[8] Fudan Univ, Shanghai Canc Ctr, Dept Radiat Oncol, Dept Oncol,Shanghai Med Coll, Shanghai, Peoples R China
[9] Cardiff Univ, Sch Engn, Cardiff, Wales
[10] Velindre Canc Ctr, Cardiff, Wales
[11] Erasmus MC, Canc Inst, Dept Radiat Oncol, Rotterdam, Netherlands
基金
欧盟地平线“2020”;
关键词
Lung cancer; Big data; Distributed learning; Federated learning; Machine learning; Survival analysis; Prediction modeling; FAIR data; CARE;
D O I
10.1016/j.radonc.2019.11.019
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background and purpose: Access to healthcare data is indispensable for scientific progress and innovation. Sharing healthcare data is time-consuming and notoriously difficult due to privacy and regulatory concerns. The Personal Health Train (PHT) provides a privacy-by-design infrastructure connecting FAIR (Findable, Accessible, Interoperable, Reusable) data sources and allows distributed data analysis and machine learning. Patient data never leaves a healthcare institute. Materials and methods: Lung cancer patient-specific databases (tumor staging and post-treatment survival information) of oncology departments were translated according to a FAIR data model and stored locally in a graph database. Software was installed locally to enable deployment of distributed machine learning algorithms via a central server. Algorithms (MATLAB, code and documentation publicly available) are patient privacy-preserving as only summary statistics and regression coefficients are exchanged with the central server. A logistic regression model to predict post-treatment two-year survival was trained and evaluated by receiver operating characteristic curves (ROC), root mean square prediction error (RMSE) and calibration plots. Results: In 4 months, we connected databases with 23 203 patient cases across 8 healthcare institutes in 5 countries (Amsterdam, Cardiff, Maastricht, Manchester, Nijmegen, Rome, Rotterdam, Shanghai) using the PHT. Summary statistics were computed across databases. A distributed logistic regression model predicting post-treatment two-year survival was trained on 14 810 patients treated between 1978 and 2011 and validated on 8 393 patients treated between 2012 and 2015. Conclusion: The PHT infrastructure demonstrably overcomes patient privacy barriers to healthcare data sharing and enables fast data analyses across multiple institutes from different countries with different regulatory regimens. This infrastructure promotes global evidence-based medicine while prioritizing patient privacy. (C) 2019 The Authors. Published by Elsevier B.V.
引用
收藏
页码:189 / 200
页数:12
相关论文
共 50 条
  • [31] Prognosis of Lung Cancer Patients Diagnosed with National Health Surveillance
    Byun, Chun Sung
    Park, Il Hwan
    Lee, Myoung Kyu
    Lee, Won Yeon
    JOURNAL OF THORACIC ONCOLOGY, 2015, 10 (09) : S618 - S619
  • [32] Perceptions of Health Status and Survival in Patients With Metastatic Lung Cancer
    Greer, Joseph A.
    Pirl, William F.
    Jackson, Vicki A.
    Muzikansky, Alona
    Lennes, Inga T.
    Gallagher, Emily R.
    Prigerson, Holly G.
    Temel, Jennifer S.
    JOURNAL OF PAIN AND SYMPTOM MANAGEMENT, 2014, 48 (04) : 548 - 557
  • [33] PERCEPTIONS OF HEALTH STATUS AND SURVIVAL IN PATIENTS WITH METASTATIC LUNG CANCER
    Greer, Joseph
    Pirl, William
    Jackson, Vicki
    Gallagher, Emily
    Temel, Jennifer
    ANNALS OF BEHAVIORAL MEDICINE, 2012, 43 : S83 - S83
  • [34] Profiling Lung Cancer Patients Using Electronic Health Records
    Ernestina Menasalvas Ruiz
    Juan Manuel Tuñas
    Guzmán Bermejo
    Consuelo Gonzalo Martín
    Alejandro Rodríguez-González
    Massimiliano Zanin
    Cristina González de Pedro
    Marta Méndez
    Olga Zaretskaia
    Jesús Rey
    Consuelo Parejo
    Juan Luis Cruz Bermudez
    Mariano Provencio
    Journal of Medical Systems, 2018, 42
  • [35] Profiling Lung Cancer Patients Using Electronic Health Records
    Menasalvas Ruiz, Ernestina
    Manuel Tunas, Juan
    Bermejo, Guzman
    Gonzalo Martin, Consuelo
    Rodriguez-Gonzalez, Alejandro
    Zanin, Massimiliano
    Gonzalez de Pedro, Cristina
    Mendez, Marta
    Zaretskaia, Olga
    Rey, Jesus
    Parejo, Consuelo
    Cruz Bermudez, Juan Luis
    Provencio, Mariano
    JOURNAL OF MEDICAL SYSTEMS, 2018, 42 (07)
  • [36] Value Based Health Care Analysis for Lung Cancer Patients
    Van Den Borne, Ben
    Dumoulin, Daphne
    JOURNAL OF THORACIC ONCOLOGY, 2015, 10 (09) : S423 - S423
  • [37] The Influence of Health Behaviors on Survival in Lung Cancer Patients in Taiwan
    Li, Ya-Hsin
    Shieh, Shwn-Huey
    Chen, Chih-Yi
    JAPANESE JOURNAL OF CLINICAL ONCOLOGY, 2011, 41 (03) : 365 - 372
  • [38] Health locus of control and quality of life in lung cancer patients
    DeValck, C
    Vinck, J
    PATIENT EDUCATION AND COUNSELING, 1996, 28 (02) : 179 - 186
  • [39] CANCER-SPECIFIC DISTRESS, COPING, AND HEALTH BEHAVIORS IN LUNG CANCER PATIENTS
    Bayley, Rene C.
    Rebholz, Whitney
    Salmon, Paul
    Sephton, Sandra
    PSYCHOSOMATIC MEDICINE, 2013, 75 (03): : A159 - A159
  • [40] OCCURRENCE AND PREDICTORS OF SEARCH FOR PERSONAL MEANING AND REATTRIBUTION IN PATIENTS WITH LUNG, BREAST OR OVARIAN CANCER
    Vehling, Sigrun
    Mehnert, Anja
    ASIA-PACIFIC JOURNAL OF CLINICAL ONCOLOGY, 2012, 8 : 290 - 290