Variable selection for nonlinear dimensionality reduction of biological datasets through bootstrapping of correlation networks

被引:0
|
作者
Aragones, David G. [1 ,2 ]
Palomino-Segura, Miguel [3 ,4 ,5 ]
Sicilia, Jon [3 ]
Crainiciuc, Georgiana [3 ]
Ballesteros, Ivan [3 ]
Sanchez-Cabo, Fatima [6 ]
Hidalgo, Andres [7 ,8 ]
Calvo, Gabriel F. [1 ,2 ]
机构
[1] Univ Castilla La Mancha, Dept Math, Ciudad Real, Spain
[2] Univ Castilla La Mancha, MOLAB Math Oncol Lab, Ciudad Real, Spain
[3] Ctr Nacl Invest Cardiovasc Carlos III, Area Cell & Dev Biol, Madrid, Spain
[4] Inst Univ Invest Biosanitaria Extremadura INUBE, Immunophysiol Res Grp, Badajoz, Spain
[5] Univ Extremadura, Fac Sci, Dept Physiol, Badajoz, Spain
[6] Ctr Nacl Invest Cardiovasc Carlos III, Bioinformat Unit, Madrid, Spain
[7] Yale Univ, Sch Med, Vasc Biol & Therapeut Program, New Haven, CT USA
[8] Yale Univ, Dept Immunobiol, Sch Med, New Haven, CT USA
关键词
Artificial intelligence; Machine learning; Unsupervised learning; Feature selection; UMAP; Complex systems; SEQ DATA; CELL; MODEL;
D O I
10.1016/j.compbiomed.2023.107827
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Identifying the most relevant variables or features in massive datasets for dimensionality reduction can lead to improved and more informative display, faster computation times, and more explainable models of complex systems. Despite significant advances and available algorithms, this task generally remains challenging, especially in unsupervised settings. In this work, we propose a method that constructs correlation networks using all intervening variables and then selects the most informative ones based on network bootstrapping. The method can be applied in both supervised and unsupervised scenarios. We demonstrate its functionality by applying Uniform Manifold Approximation and Projection for dimensionality reduction to several highdimensional biological datasets, derived from 4D live imaging recordings of hundreds of morpho-kinetic variables, describing the dynamics of thousands of individual leukocytes at sites of prominent inflammation. We compare our method with other standard ones in the field, such as Principal Component Analysis and Elastic Net, showing that it outperforms them. The proposed method can be employed in a wide range of applications, encompassing data analysis and machine learning.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Efficient Recognition of Ictal Activities in EEG through Correlation Based Dimensionality Reduction
    Nara, Sanjeev
    Swami, Piyush
    Bhatia, Manvir
    Panigrahi, Bijaya K.
    Gandhi, Tapan
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 2547 - 2550
  • [32] Dimensionality Reduction and Variable Selection in Multivariate Varying-Coefficient Models With a Large Number of Covariates
    He, Kejun
    Lian, Heng
    Ma, Shujie
    Huang, Jianhua Z.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (522) : 746 - 754
  • [33] Filter Variable Selection Algorithm Using Risk Ratios for Dimensionality Reduction of Healthcare Data for Classification
    Bodur, Ersin Kuset
    Atsa'am, Donald Douglas
    PROCESSES, 2019, 7 (04)
  • [34] Online dimensionality reduction through stacked generalization of spectral methods with deep networks
    Alvarado-Perez, Juan Carlos
    Garcia, Miguel Angel
    Puig, Domenec
    MACHINE LEARNING, 2025, 114 (05)
  • [35] VARIABLE SELECTION IN NONLINEAR MODELING BASED ON RBF NETWORKS AND EVOLUTIONARY COMPUTATION
    Patrinos, Panagiotis
    Alexandridis, Alex
    Ninos, Konstantinos
    Sarimveis, Haralambos
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2010, 20 (05) : 365 - 379
  • [36] Prediction of petrophysical static rock type through nonlinear dimensionality reduction and mutual information
    Tian, Haitao
    Huang, Lei
    Zhang, Ke
    EARTH SCIENCE INFORMATICS, 2025, 18 (01)
  • [37] Pinning control of networks: Dimensionality reduction through simultaneous block-diagonalization of matrices
    Panahi, Shirin
    Lodi, Matteo
    Storace, Marco
    Sorrentino, Francesco
    CHAOS, 2022, 32 (11)
  • [38] Self-Healing Framework for Next-Generation Networks through Dimensionality Reduction
    Palacios, David
    Fortes, Sergio
    de-la-Bandera, Isabel
    Barco, Raquel
    IEEE COMMUNICATIONS MAGAZINE, 2018, 56 (07) : 170 - 176
  • [39] Nonlinear dimensionality reduction method of scheduling frequent information in wireless networks based on multilevel mapping
    Jian-zhao Sun
    Kun Yang
    Marcin Woźniak
    Wireless Networks, 2023, 29 : 2897 - 2907
  • [40] A co-kurtosis PCA based dimensionality reduction with nonlinear reconstruction using neural networks
    Nayak, Dibyajyoti
    Jonnalagadda, Anirudh
    Balakrishnan, Uma
    Kolla, Hemanth
    Aditya, Konduri
    COMBUSTION AND FLAME, 2024, 259