Missing Data Estimation in High-Dimensional Datasets: A Swarm Intelligence-Deep Neural Network Approach

被引:11
|
作者
Leke, Collins [1 ]
Marwala, Tshilidzi [1 ]
机构
[1] Univ Johannesburg, Johannesburg, South Africa
关键词
Missing data; Deep learning; Swarm intelligence; High-dimensional data; Supervised learning; Unsupervised learning; IMPUTATION;
D O I
10.1007/978-3-319-41000-5_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we examine the problem of missing data in high-dimensional datasets by taking into consideration the Missing Completely at Random and Missing at Random mechanisms, as well as the Arbitrary missing pattern. Additionally, this paper employs a methodology based on Deep Learning and Swarm Intelligence algorithms in order to provide reliable estimates for missing data. The deep learning technique is used to extract features from the input data via an unsupervised learning approach by modeling the data distribution based on the input. This deep learning technique is then used as part of the objective function for the swarm intelligence technique in order to estimate the missing data after a supervised fine-tuning phase by minimizing an error function based on the interrelationship and correlation between features in the dataset. The investigated methodology in this paper therefore has longer running times, however, the promising potential outcomes justify the trade-off. Also, basic knowledge of statistics is presumed.
引用
收藏
页码:259 / 270
页数:12
相关论文
共 50 条
  • [1] A Deep Learning-Cuckoo Search Method for Missing Data Estimation in High-Dimensional Datasets
    Leke, Collins
    Ndjiongue, Alain Richard
    Twala, Bhekisipho
    Marwala, Tshilidzi
    ADVANCES IN SWARM INTELLIGENCE, ICSI 2017, PT I, 2017, 10385 : 561 - 572
  • [2] Broad and deep neural network for high-dimensional data representation learning
    Feng, Qiying
    Liu, Zhulin
    Chen, C. L. Philip
    INFORMATION SCIENCES, 2022, 599 : 127 - 146
  • [3] Nonparametric Estimation for High-Dimensional Space Models Based on a Deep Neural Network
    Wang, Hongxia
    Jin, Xiao
    Wang, Jianian
    Hao, Hongxia
    MATHEMATICS, 2023, 11 (18)
  • [4] Deep Neural Fuzzy System Oriented toward High-Dimensional Data and Interpretable Artificial Intelligence
    Chen, Dewang
    Cai, Jijie
    Huang, Yunhu
    Lv, Yisheng
    APPLIED SCIENCES-BASEL, 2021, 11 (16):
  • [5] ENNS: Variable Selection, Regression, Classification and Deep Neural Network for High-Dimensional Data
    Yang, Kaixu
    Ganguli, Arkaprabha
    Maiti, Tapabrata
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25
  • [6] A method for learning a sparse classifier in the presence of missing data for high-dimensional biological datasets
    Severson, Kristen A.
    Monian, Brinda
    Love, J. Christopher
    Braatz, Richard D.
    BIOINFORMATICS, 2017, 33 (18) : 2897 - 2905
  • [7] Missing Data Imputation with High-Dimensional Data
    Brini, Alberto
    van den Heuvel, Edwin R.
    AMERICAN STATISTICIAN, 2024, 78 (02): : 240 - 252
  • [8] Deep Learning-Bat High-Dimensional Missing Data Estimator
    Leke, Collins
    Ndjiongue, A. R.
    Twala, Bhekisipho
    Marwala, Tshilidzi
    2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 483 - 488
  • [9] Optimal estimation of high-dimensional sparse covariance matrices with missing data
    Miao, Li
    Wang, Jinru
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2024,
  • [10] An immune approach to classifying the high-dimensional datasets
    Chmielewski, Andrzej
    Wierzchon, Slawomir T.
    2008 INTERNATIONAL MULTICONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (IMCSIT), VOLS 1 AND 2, 2008, : 79 - +