A structured iterative division approach for non-sparse regression models and applications in biological data analysis

被引:0
|
作者
Yu, Shun [1 ]
Yang, Yuehan [1 ]
机构
[1] Cent Univ Finance & Econ, Sch Stat & Math, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Non-sparse structure; biology problem; dividing strategy; coordinate descent; SELECTION;
D O I
10.1177/09622802241254251
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
In this paper, we focus on the modeling problem of estimating data with non-sparse structures, specifically focusing on biological data that exhibit a high degree of relevant features. Various fields, such as biology and finance, face the challenge of non-sparse estimation. We address the problems using the proposed method, called structured iterative division. Structured iterative division effectively divides data into non-sparse and sparse structures and eliminates numerous irrelevant variables, significantly reducing the error while maintaining computational efficiency. Numerical and theoretical results demonstrate the competitive advantage of the proposed method on a wide range of problems, and the proposed method exhibits excellent statistical performance in numerical comparisons with several existing methods. We apply the proposed algorithm to two biology problems, gene microarray datasets, and chimeric protein datasets, to the prognostic risk of distant metastasis in breast cancer and Alzheimer's disease, respectively. Structured iterative division provides insights into gene identification and selection, and we also provide meaningful results in anticipating cancer risk and identifying key factors.
引用
收藏
页码:1233 / 1248
页数:16
相关论文
共 39 条
  • [1] Non-sparse multiple kernel learning approach for support vector regression
    Hu, Qinghui
    Ding, Lixin
    Liu, Xiaogang
    Li, Zhaokui
    Sichuan Daxue Xuebao (Gongcheng Kexue Ban)/Journal of Sichuan University (Engineering Science Edition), 2015, 47 (04): : 91 - 97
  • [2] Applications of Multilevel Structured Additive Regression Models to Insurance Data
    Lang, Stefan
    Umlauf, Nikolaus
    COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, : 155 - 164
  • [3] Genome-Phenome Association Analysis of Complex Diseases a Structured Sparse Regression Approach
    Xing, Eric
    BIOINFORMATICS RESEARCH AND APPLICATIONS, 2011, 6674 : 11 - 11
  • [4] Sparse multivariate factor analysis regression models and its applications to integrative genomics analysis
    Zhou, Yan
    Wang, Pei
    Wang, Xianlong
    Zhu, Ji
    Song, Peter X. -K.
    GENETIC EPIDEMIOLOGY, 2017, 41 (01) : 70 - 80
  • [5] A Sparse Latent Regression Approach for Integrative Analysis of Glycomic and Glycotranscriptomic Data
    Wang, Xuefu
    Li, Sujun
    Peng, Wenjing
    Mechref, Yehia
    Tang, Haixu
    ACM-BCB' 2017: PROCEEDINGS OF THE 8TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY,AND HEALTH INFORMATICS, 2017, : 273 - 278
  • [6] Analysis of sparse data in logistic regression in medical research: A newer approach
    Devika, S.
    Jeyaseelan, L.
    Sebastian, G.
    JOURNAL OF POSTGRADUATE MEDICINE, 2016, 62 (01) : 26 - 31
  • [7] Sparse regularized low-rank tensor regression with applications in genomic data analysis
    Le Ou-Yang
    Zhang, Xiao-Fei
    Yan, Hong
    PATTERN RECOGNITION, 2020, 107
  • [8] Iterative approach of dual regression with a sparse prior enhances the performance of independent component analysis for group functional magnetic resonance imaging (fMRI) data
    Kim, Yong-Hwan
    Kim, Junghoe
    Lee, Jong-Hwan
    NEUROIMAGE, 2012, 63 (04) : 1864 - 1889
  • [9] REGRESSION BASED PRINCIPAL COMPONENT ANALYSIS FOR SPARSE FUNCTIONAL DATA WITH APPLICATIONS TO SCREENING GROWTH PATHS
    Zhang, Wenfei
    Wei, Ying
    ANNALS OF APPLIED STATISTICS, 2015, 9 (02): : 597 - 620
  • [10] Bivariate location-scale models for regression analysis, with applications to lifetime data
    He, WQ
    Lawless, JF
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2005, 67 : 63 - 78