3D-MICE: integration of cross-sectional and longitudinal imputation for multi-analyte longitudinal clinical data

被引:73
|
作者
Luo, Yuan [1 ]
Szolovits, Peter [2 ]
Dighe, Anand S. [3 ,4 ]
Baron, Jason M. [3 ,4 ]
机构
[1] Northwestern Univ, Dept Prevent Med, Chicago, IL 60611 USA
[2] MIT, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[3] Massachusetts Gen Hosp, Dept Pathol, Boston, MA 02114 USA
[4] Harvard Med Sch, Boston, MA USA
关键词
machine learning; imputation; missing data; electronic health record; EHR; multiple imputation with chained equations; Gaussian process; computational pathology; data mining; MISSING VALUE IMPUTATION; MULTIPLE IMPUTATION; SURVIVAL ANALYSIS; VALUES;
D O I
10.1093/jamia/ocx133
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: A key challenge in clinical data mining is that most clinical datasets contain missing data. Since many commonly used machine learning algorithms require complete datasets (no missing data), clinical analytic approaches often entail an imputation procedure to "fill in" missing data. However, although most clinical datasets contain a temporal component, most commonly used imputation methods do not adequately accommodate longitudinal time-based data. We sought to develop a new imputation algorithm, 3-dimensional multiple imputation with chained equations (3D-MICE), that can perform accurate imputation of missing clinical time series data. Methods: We extracted clinical laboratory test results for 13 commonly measured analytes (clinical laboratory tests). We imputed missing test results for the 13 analytes using 3 imputation methods: multiple imputation with chained equations (MICE), Gaussian process (GP), and 3D-MICE. 3D-MICE utilizes both MICE and GP imputation to integrate cross-sectional and longitudinal information. To evaluate imputation method performance, we randomly masked selected test results and imputed these masked results alongside results missing from our original data. We compared predicted results to measured results for masked data points. Results: 3D-MICE performed significantly better than MICE and GP-based imputation in a composite of all 13 analytes, predicting missing results with a normalized root-mean-square error of 0.342, compared to 0.373 for MICE alone and 0.358 for GP alone. Conclusions: 3D-MICE offers a novel and practical approach to imputing clinical laboratory time series data. 3D-MICE may provide an additional tool for use as a foundation in clinical predictive analytics and intelligent clinical decision support.
引用
收藏
页码:645 / 653
页数:9
相关论文
共 50 条
  • [41] The clinical assessment of the cross-sectional longitudinal elasticity distribution of plaque in the carotid artery
    Koiwa, Y
    Katoh, G
    Okimoto, H
    Ishigaki, Y
    Oka, Y
    Seiji, K
    Satoh, K
    Katoh, M
    Hagiwara, H
    Tan-naka, Y
    CIRCULATION, 2005, 112 (17) : U488 - U488
  • [42] Clinical characteristics in patients with ossification of the posterior longitudinal ligament: A prospective multi-institutional cross-sectional study
    Hirai, Takashi
    Yoshii, Toshitaka
    Ushio, Shuta
    Mori, Kanji
    Maki, Satoshi
    Katsumi, Keiichi
    Nagoshi, Narihito
    Takeuchi, Kazuhiro
    Furuya, Takeo
    Watanabe, Kei
    Nishida, Norihiro
    Watanabe, Kota
    Kaito, Takashi
    Kato, Satoshi
    Nagashima, Katsuya
    Koda, Masao
    Ito, Kenyu
    Imagama, Shiro
    Matsuoka, Yuji
    Wada, Kanichiro
    Kimura, Atsushi
    Ohba, Tetsuro
    Katoh, Hiroyuki
    Matsuyama, Yukihiro
    Ozawa, Hiroshi
    Haro, Hirotaka
    Takeshita, Katsushi
    Watanabe, Masahiko
    Matsumoto, Morio
    Nakamura, Masaya
    Yamazaki, Masashi
    Okawa, Atsushi
    Kawaguchi, Yoshiharu
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [43] Clinical characteristics in patients with ossification of the posterior longitudinal ligament: A prospective multi-institutional cross-sectional study
    Takashi Hirai
    Toshitaka Yoshii
    Shuta Ushio
    Kanji Mori
    Satoshi Maki
    Keiichi Katsumi
    Narihito Nagoshi
    Kazuhiro Takeuchi
    Takeo Furuya
    Kei Watanabe
    Norihiro Nishida
    Kota Watanabe
    Takashi Kaito
    Satoshi Kato
    Katsuya Nagashima
    Masao Koda
    Kenyu Ito
    Shiro Imagama
    Yuji Matsuoka
    Kanichiro Wada
    Atsushi Kimura
    Tetsuro Ohba
    Hiroyuki Katoh
    Yukihiro Matsuyama
    Hiroshi Ozawa
    Hirotaka Haro
    Katsushi Takeshita
    Masahiko Watanabe
    Morio Matsumoto
    Masaya Nakamura
    Masashi Yamazaki
    Atsushi Okawa
    Yoshiharu Kawaguchi
    Scientific Reports, 10
  • [44] STATISTICAL-METHODS FOR ANALYSES OF INCOMPLETE CLINICAL SERVICE RECORDS - CONCURRENT USE OF LONGITUDINAL AND CROSS-SECTIONAL DATA
    LEAF, RC
    DIGIUSEPPE, R
    MASS, R
    ALINGTON, DE
    JOURNAL OF CONSULTING AND CLINICAL PSYCHOLOGY, 1993, 61 (03) : 495 - 505
  • [45] Cross-Sectional and Longitudinal Natural History of CLN3 Disease Progression
    Masten, Margaux C.
    Vermilion, Jennifer
    Adams, Heather R.
    Vierhile, Amy
    Marshall, Frederick J.
    Beck, Christopher A.
    Mink, Jonathan W.
    Augustine, Erika F.
    ANNALS OF NEUROLOGY, 2020, 88 : S213 - S213
  • [46] Cross-sectional and Longitudinal Natural History of CLN3 Disease Progression
    Masten, Margaux
    Vermilion, Jennifer
    Adams, Heather
    Vierhile, Amy
    Marshall, Frederick
    Beck, Christopher
    Mink, Jonathan
    Augustine, Erika
    NEUROLOGY, 2020, 94 (15)
  • [47] Cross-sectional data underestimates longitudinal changes in body composition of older individuals.
    Beckett, T
    Ades, PA
    Poehlman, ET
    OBESITY RESEARCH, 2000, 8 : 107S - 107S
  • [48] Exploiting the interplay between cross-sectional and longitudinal data in Class III malocclusion patients
    Enrico Barelli
    Ennio Ottaviani
    Pietro Auconi
    Guido Caldarelli
    Veronica Giuntini
    James A. McNamara
    Lorenzo Franchi
    Scientific Reports, 9
  • [49] MODELING INCOMPLETE LONGITUDINAL AND CROSS-SECTIONAL DATA USING LATENT GROWTH STRUCTURAL MODELS
    MCARDLE, JJ
    HAMAGAMI, F
    EXPERIMENTAL AGING RESEARCH, 1992, 18 (3-4) : 145 - 166
  • [50] Workplace Bullying and Mental Health: A Meta-Analysis on Cross-Sectional and Longitudinal Data
    Verkuil, Bart
    Atasayi, Serpil
    Molendijk, Marc L.
    PLOS ONE, 2015, 10 (08):