A Study of High-Dimensional Data Imputation Using Additive LASSO Regression Model

被引:4
|
作者
Lavanya, K. [1 ]
Reddy, L. S. S. [2 ]
Reddy, B. Eswara [3 ]
机构
[1] JNTUA, Dept Comp Sci & Engn, Anantapur 515822, Andhra Pradesh, India
[2] KLU, Dept Comp Sci & Engn, Guntur 522502, Andhra Pradesh, India
[3] JNTUA, Dept Comp Sci, Anantapur 517234, Andhra Pradesh, India
关键词
High-dimensional data; Multiple imputations; Regression; Missing data; MULTIPLE IMPUTATION; MISSING-DATA; METAANALYSIS; HETEROGENEITY;
D O I
10.1007/978-981-10-8055-5_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid growth of computational domains, bioinformatics finance, engineering, biometrics, and neuroimaging emphasize the necessity for analyzing high-dimensional data. Many real-world datasets may contain hundreds or thousands of features. The common problem in most of the knowledge-based classification problems is quality and quantity of data. In general, the common problem with many high-dimensional data samples is that it contains missing or unknown attribute values, incomplete feature vectors, and uncertain or vague data which have to be handled carefully. Due to the presence of a large segment of missing values in the datasets, refined multiple imputation methods are required to estimate the missing values so that a fair and more consistent analysis can be achieved. In this paper, three imputation (MI) methods, mean, imputations predictive mean, and imputations by additive LASSO, are employed in cloud. Results show that imputations by additive LASSO are the preferred multiple imputation (MI) method.
引用
收藏
页码:19 / 30
页数:12
相关论文
共 50 条
  • [1] LASSO Isotone for High-Dimensional Additive Isotonic Regression
    Fang, Zhou
    Meinshausen, Nicolai
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2012, 21 (01) : 72 - 91
  • [2] An explainable fused lasso regression model for handling high-dimensional fuzzy data
    Hesamian, Gholamreza
    Johannssen, Arne
    Chukhrova, Nataliya
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2024, 441
  • [3] EFFICIENT FUNCTIONAL LASSO KERNEL SMOOTHING FOR HIGH-DIMENSIONAL ADDITIVE REGRESSION
    Lee, Eun Ryung
    Park, Seyoung
    Mammen, Enno
    Park, Byeong U.
    ANNALS OF STATISTICS, 2024, 52 (04): : 1741 - 1773
  • [4] Localized Lasso for High-Dimensional Regression
    Yamada, Makoto
    Takeuchi, Koh
    Iwata, Tomoharu
    Shawe-Taylor, John
    Kaski, Samuel
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 54, 2017, 54 : 325 - 333
  • [5] The joint lasso: high-dimensional regression for group structured data
    Dondelinger, Frank
    Mukherjee, Sach
    BIOSTATISTICS, 2020, 21 (02) : 219 - 235
  • [6] High-dimensional additive hazards models and the Lasso
    Gaiffas, Stephane
    Guilloux, Agathe
    ELECTRONIC JOURNAL OF STATISTICS, 2012, 6 : 522 - 546
  • [7] Influence Diagnostics for High-Dimensional Lasso Regression
    Rajaratnam, Bala
    Roberts, Steven
    Sparks, Doug
    Yu, Honglin
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2019, 28 (04) : 877 - 890
  • [8] Nonparametric Additive Regression for High-Dimensional Group Testing Data
    Zuo, Xinlei
    Ding, Juan
    Zhang, Junjian
    Xiong, Wenjun
    MATHEMATICS, 2024, 12 (05)
  • [9] DOUBLY PENALIZED ESTIMATION IN ADDITIVE REGRESSION WITH HIGH-DIMENSIONAL DATA
    Tan, Zhiqiang
    Zhang, Cun-Hui
    ANNALS OF STATISTICS, 2019, 47 (05): : 2567 - 2600
  • [10] Missing Data Imputation with High-Dimensional Data
    Brini, Alberto
    van den Heuvel, Edwin R.
    AMERICAN STATISTICIAN, 2024, 78 (02): : 240 - 252