A joint learning method for incomplete and imbalanced data in electronic health record based on generative adversarial networks

被引:2
|
作者
Weng, Xutao [1 ]
Song, Hong [1 ]
Lin, Yucong [2 ]
Wu, You [3 ]
Zhang, Xi [1 ]
Liu, Bowen [3 ]
Yang, Jian [2 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Sch Opt & Photon, Beijing 100081, Peoples R China
[3] Beijing Inst Technol, Sch Med Technol, Beijing 100081, Peoples R China
关键词
Electronic health records; Generative adversarial networks; Imbalanced learning; Missing values imputation; MISSING DATA; IMPUTATION; CLASSIFICATION;
D O I
10.1016/j.compbiomed.2023.107687
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Electronic health records (EHR), present challenges of incomplete and imbalanced data in clinical predictions. Previous studies addressed these two issues with two-step separately, which caused the decrease in the performance of prediction tasks. In this paper, we propose a unified framework to simultaneously addresses the challenges of incomplete and imbalanced data in EHR. Based on the framework, we develop a model called Missing Value Imputation and Imbalanced Learning Generative Adversarial Network (MVIIL-GAN). We use MVIIL-GAN to perform joint learning on the imputation process of high missing rate data and the conditional generation process of EHR data. The joint learning is achieved by introducing two discriminators to distinguish the fake data from the generated data at sample-level and variable-level. MVIIL-GAN integrate the missing values imputation and data generation in one step, improving the consistency of parameter optimization and the performance of prediction tasks. We evaluate our framework using the public dataset MIMIC-IV with high missing rates data and imbalanced data. Experimental results show that MVIIL-GAN outperforms existing methods in prediction performance. The implementation of MVIIL-GAN can be found at https://github.com/P eroxidess/MVIIL-GAN.
引用
收藏
页数:12
相关论文
共 50 条
  • [11] An Intelligent Fault Diagnosis Method for Imbalanced Nuclear Power Plant Data Based on Generative Adversarial Networks
    Yuntao Dai
    Lizhang Peng
    Zhaobo Juan
    Yuan Liang
    Jihong Shen
    Shujuan Wang
    Sichao Tan
    Hongyan Yu
    Mingze Sun
    Journal of Electrical Engineering & Technology, 2023, 18 : 3237 - 3252
  • [12] An Intelligent Fault Diagnosis Method for Imbalanced Nuclear Power Plant Data Based on Generative Adversarial Networks
    Dai, Yuntao
    Peng, Lizhang
    Juan, Zhaobo
    Liang, Yuan
    Shen, Jihong
    Wang, Shujuan
    Tan, Sichao
    Yu, Hongyan
    Sun, Mingze
    JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2023, 18 (04) : 3237 - 3252
  • [13] A tutorial on generative adversarial networks with application to classification of imbalanced data
    Huang, Yuxiao
    Fields, Kara G.
    Ma, Yan
    STATISTICAL ANALYSIS AND DATA MINING, 2022, 15 (05) : 543 - 552
  • [14] Fault diagnosis method based on triple generative adversarial nets for imbalanced data
    Su, Changwei
    Wang, Xueren
    Liu, Ruijie
    Guo, Ziyi
    Sang, Shengtian
    Yu, Shuang
    Zhang, Haifeng
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2023, 34 (03)
  • [15] Imbalanced Learning for Fault Diagnosis Problem of Rotating Machinery Based on Generative Adversarial Networks
    Xie, Yuan
    Zhang, Tao
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 6017 - 6022
  • [16] Research on Imbalanced Data Classification Based on Classroom-Like Generative Adversarial Networks
    Lv, Yancheng
    Lin, Lin
    Liu, Jie
    Guo, Hao
    Tong, Changsheng
    NEURAL COMPUTATION, 2022, 34 (04) : 1045 - 1073
  • [17] A clustering and generative adversarial networks-based hybrid approach for imbalanced data classification
    Ding H.
    Cui X.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (06) : 8003 - 8018
  • [18] A Novel Method for Imbalanced Fault Diagnosis of Rotating Machinery Based on Generative Adversarial Networks
    Li, Zhenxiang
    Zheng, Taisheng
    Wang, Yang
    Cao, Zhi
    Guo, Zhiqi
    Fu, Hongyong
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70
  • [19] Multiple Imputation by Generative Adversarial Networks for Classification with Incomplete Data
    Bao Ngoc Vi
    Dinh Tan Nguyen
    Cao Truong Tran
    Huu Phuc Ngo
    Chi Cong Nguyen
    Hai-Hong Phan
    2021 RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF 2021), 2021, : 162 - 167
  • [20] A new imbalanced data oversampling method based on Bootstrap method and Wasserstein Generative Adversarial Network
    Hou, Binjie
    Chen, Gang
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2024, 21 (03) : 4309 - 4327