Multi-Label Clinical Time-Series Generation via Conditional GAN

被引:4
|
作者
Lu, Chang [1 ]
Reddy, Chandan K. [2 ]
Wang, Ping [1 ]
Nie, Dong [3 ]
Ning, Yue [1 ]
机构
[1] Stevens Inst Technol, Dept Comp Sci, Hoboken, NJ 07310 USA
[2] Virginia Tech, Dept Comp Sci, Arlington, VA 22203 USA
[3] Univ North Carolina Chapel Hill, Dept Comp Sci, Chapel Hill, NC 27599 USA
基金
美国国家科学基金会;
关键词
Diseases; Generative adversarial networks; Generators; Training; Task analysis; Synthetic data; Measurement; Electronic health records; generative adversarial network (GAN); time-series generation; imbalanced data;
D O I
10.1109/TKDE.2023.3310909
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, deep learning has been successfully adopted in a wide range of applications related to electronic health records (EHRs) such as representation learning and clinical event prediction. However, due to privacy constraints, limited access to EHR becomes a bottleneck for deep learning research. To mitigate these concerns, generative adversarial networks (GANs) have been successfully used for generating EHR data. However, there are still challenges in high-quality EHR generation, including generating time-series EHR data and imbalanced uncommon diseases. In this work, we propose a Multi-label Time-series GAN (MTGAN) to generate EHR and simultaneously improve the quality of uncommon disease generation. The generator of MTGAN uses a gated recurrent unit (GRU) with a smooth conditional matrix to generate sequences and uncommon diseases. The critic gives scores using Wasserstein distance to recognize real samples from synthetic samples by considering both data and temporal features. We also propose a training strategy to calculate temporal features for real data and stabilize GAN training. Furthermore, we design multiple statistical metrics and prediction tasks to evaluate the generated data. Experimental results demonstrate the quality of the synthetic data and the effectiveness of MTGAN in generating realistic sequential EHR data, especially for uncommon diseases.
引用
收藏
页码:1728 / 1740
页数:13
相关论文
共 50 条
  • [1] Comparing Multi-label Classification with Reinforcement Learning for Summarisation of Time-series Data
    Gkatzia, Dimitra
    Hastie, Helen
    Lemon, Oliver
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 1231 - 1240
  • [2] On time series representations for multi-label NILM
    Christoforos Nalmpantis
    Dimitris Vrakas
    Neural Computing and Applications, 2020, 32 : 17275 - 17290
  • [3] On time series representations for multi-label NILM
    Nalmpantis, Christoforos
    Vrakas, Dimitris
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (23): : 17275 - 17290
  • [4] A Label Embedding Method via Conditional Covariance Maximization for Multi-label Classification
    Li, Dan
    Li, Yunqian
    Li, Jun
    Xu, Jianhua
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2023, PT II, 2023, 14147 : 393 - 407
  • [5] Multi-Label Conditional Generation From Pre-Trained Models
    Proszewska, Magdalena
    Wolczyk, Maciej
    Zieba, Maciej
    Wielopolski, Patryk
    Maziarka, Lukasz
    Smieja, Marek
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) : 6185 - 6198
  • [6] Data reduction via multi-label prototype generation
    Ougiaroglou, Stefanos
    Filippakis, Panagiotis
    Fotiadou, Georgia
    Evangelidis, Georgios
    NEUROCOMPUTING, 2023, 526 : 1 - 8
  • [7] On the generation of multi-label prototypes
    Bello, Marilyn
    Napoles, Gonzalo
    Vanhoof, Koen
    Bello, Rafael
    INTELLIGENT DATA ANALYSIS, 2020, 24 (S1) : S167 - S183
  • [8] PluGeN: Multi-Label Conditional Generation from Pre-trained Models
    Wolczyk, Maciej
    Proszewska, Magdalena
    Maziarka, Lukasz
    Zieba, Maciej
    Wielopolski, Patryk
    Kurczab, Rafal
    Smieja, Marek
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8647 - 8656
  • [9] Text Recommendation Based on Time Series and Multi-label Information
    Yin, Yi
    Feng, Dan
    Shi, Zhan
    Ouyang, Lin
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2021, 18 (02) : 419 - 439
  • [10] Conditional Bernoulli Mixtures for Multi-Label Classification
    Li, Cheng
    Wang, Bingyu
    Pavlu, Virgil
    Aslam, Javed
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48