Missing Data Estimation in High-Dimensional Datasets: A Swarm Intelligence-Deep Neural Network Approach

被引:11
|
作者
Leke, Collins [1 ]
Marwala, Tshilidzi [1 ]
机构
[1] Univ Johannesburg, Johannesburg, South Africa
关键词
Missing data; Deep learning; Swarm intelligence; High-dimensional data; Supervised learning; Unsupervised learning; IMPUTATION;
D O I
10.1007/978-3-319-41000-5_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we examine the problem of missing data in high-dimensional datasets by taking into consideration the Missing Completely at Random and Missing at Random mechanisms, as well as the Arbitrary missing pattern. Additionally, this paper employs a methodology based on Deep Learning and Swarm Intelligence algorithms in order to provide reliable estimates for missing data. The deep learning technique is used to extract features from the input data via an unsupervised learning approach by modeling the data distribution based on the input. This deep learning technique is then used as part of the objective function for the swarm intelligence technique in order to estimate the missing data after a supervised fine-tuning phase by minimizing an error function based on the interrelationship and correlation between features in the dataset. The investigated methodology in this paper therefore has longer running times, however, the promising potential outcomes justify the trade-off. Also, basic knowledge of statistics is presumed.
引用
收藏
页码:259 / 270
页数:12
相关论文
共 50 条
  • [11] Swarm Intelligence and Neural Network for Data Classification
    Ghanem, Waheed Ali H. M.
    Jantan, Aman
    2014 IEEE INTERNATIONAL CONFERENCE ON CONTROL SYSTEM COMPUTING AND ENGINEERING, 2014, : 196 - 201
  • [12] Contrastive learning enhanced deep neural network with serial regularization for high-dimensional tabular data
    Wu, Yao
    Zhu, Donghua
    Wang, Xuefeng
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 228
  • [13] A Configurable Deep Network for High-Dimensional Clinical Trial Data
    O' Donoghue, Jim
    Roantree, Mark
    Van Boxtel, Martin
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [14] Minimax optimal estimation of high-dimensional sparse covariance matrices with missing data
    Qi, Xinyu
    Wang, Jinru
    Zeng, Xiaochen
    INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2022, 20 (06)
  • [15] Missing data in interactive high-dimensional data visualization
    Swayne, DF
    Buja, A
    COMPUTATIONAL STATISTICS, 1998, 13 (01) : 15 - 26
  • [16] Selecting The Appropriate Data Sampling Approach for Imbalanced and High-Dimensional Bioinformatics Datasets
    Dittman, David J.
    Khoshgoftaar, Taghi M.
    Napolitano, Amri
    2014 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2014, : 304 - 310
  • [17] Aligned deep neural network for integrative analysis with high-dimensional input
    Zhang, Shunqin
    Zhang, Sanguo
    Yi, Huangdi
    Ma, Shuangge
    JOURNAL OF BIOMEDICAL INFORMATICS, 2023, 144
  • [18] High-dimensional covariance matrix estimation with missing observations
    Lounici, Karim
    BERNOULLI, 2014, 20 (03) : 1029 - 1058
  • [19] Estimation Method Based on Deep Neural Network for Consecutively Missing Sensor Data
    Liu F.
    Li H.
    Yang Z.
    Radioelectronics and Communications Systems, 2018, 61 (6) : 258 - 266
  • [20] Publishing Private High-dimensional Datasets: A Topological Approach
    Alipourjeddi, Narges
    Miri, Ali
    2022 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC, 2022, : 1142 - 1147