Principal component analysis revisited: fast multitrait genetic evaluations with smooth convergence

被引:0
|
作者
Ahlinder, Jon [1 ]
Hall, David [1 ]
Suontama, Mari [1 ]
Sillanpaa, Mikko J. [2 ]
机构
[1] Skogforsk, Dept Tree Breeding, Box 3,Tomterna 1, SE-91821 Savar, Sweden
[2] Oulu Univ, Res Unit Math Sci, FI-90014 Oulu, Finland
来源
G3-GENES GENOMES GENETICS | 2024年 / 14卷 / 12期
关键词
PCA; Loblolly pine; Scots pine; BLUP; linear mixed-effect model; convergence; genetic correlation; Plant Genetics and Genomics; MIXED MODELS; R PACKAGE; GENOMIC SELECTION; ENVIRONMENT DATA; PREDICTION; TRAITS; PHENOTYPES; REGRESSION; VARIANCES; ACCURACY;
D O I
10.1093/g3journal/jkae228
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
A cornerstone in breeding and population genetics is the genetic evaluation procedure, needed to make important decisions on population management. Multivariate mixed model analysis, in which many traits are considered jointly, utilizes genetic and environmental correlations between traits to improve the accuracy. However, the number of parameters in the multitrait model grows exponentially with the number of traits which reduces its scalability. Here, we suggest using principal component analysis to reduce the dimensions of the response variables, and then using the computed principal components as separate responses in the genetic evaluation analysis. As principal components are orthogonal to each other so that phenotypic covariance is abscent between principal components, a full multivariate analysis can be approximated by separate univariate analyses instead which should speed up computations considerably. We compared the approach to both traditional multivariate analysis and factor analytic approach in terms of computational requirement and rank lists according to predicted genetic merit on two forest tree datasets with 22 and 27 measured traits, respectively. Obtained rank lists of the top 50 individuals were in good agreement. Interestingly, the required computational time of the approach only took a few seconds without convergence issues, unlike the traditional approach which required considerably more time to run (7 and 10 h, respectively). The factor analytic approach took approximately 5-10 min. Our approach can easily handle missing data and can be used with all available linear mixed effect model softwares as it does not require any specific implementation. The approach can help to mitigate difficulties with multitrait genetic analysis in both breeding and wild populations.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] qrpca: A package for fast principal component analysis with GPU acceleration
    S. de Souza, R.
    Quanfeng, X.
    Shen, S.
    Peng, C.
    Mu, Z.
    Astronomy and Computing, 2022, 41
  • [42] qrpca: A package for fast principal component analysis with GPU acceleration
    de Souza, R. S.
    Quanfeng, X.
    Shen, S.
    Peng, C.
    Mu, Z.
    ASTRONOMY AND COMPUTING, 2022, 41
  • [43] Smooth principal component analysis with application to functional magnetic resonance imaging
    Ulfarsson, Magnus O.
    Solo, Victor
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 2241 - 2244
  • [44] Selection of microbial biomarkers with genetic algorithm and principal component analysis
    Ping Zhang
    Nicholas P. West
    Pin-Yen Chen
    Mike W. C. Thang
    Gareth Price
    Allan W. Cripps
    Amanda J. Cox
    BMC Bioinformatics, 20
  • [45] Selection of microbial biomarkers with genetic algorithm and principal component analysis
    Zhang, Ping
    West, Nicholas P.
    Chen, Pin-Yen
    Thang, Mike W. C.
    Price, Gareth
    Cripps, Allan W.
    Cox, Amanda J.
    BMC BIOINFORMATICS, 2019, 20 (01)
  • [46] Feature selection using principal component analysis and genetic algorithm
    Adhao, Rahul
    Pachghare, Vinod
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2020, 23 (02): : 595 - 602
  • [47] Principal component analysis of event-related potentials: Misallocation of variance revisited
    Achim, A
    Marcantoni, W
    PSYCHOPHYSIOLOGY, 1997, 34 (05) : 597 - 606
  • [48] A fast encoding algorithm for vector quantization based on Principal Component Analysis
    Lee, Jiann-Der
    Chiou, Yaw-Hwang
    TENCON 2007 - 2007 IEEE REGION 10 CONFERENCE, VOLS 1-3, 2007, : 1413 - 1416
  • [49] A Fast Minimum Variance Beamforming Method Using Principal Component Analysis
    Kim, Kyuhong
    Park, Suhyun
    Kim, Jungho
    Park, Sung-Bae
    Bae, MooHo
    IEEE TRANSACTIONS ON ULTRASONICS FERROELECTRICS AND FREQUENCY CONTROL, 2014, 61 (06) : 930 - 945
  • [50] Fast Ridge Regression with Randomized Principal Component Analysis and Gradient Descent
    Lu, Yichao
    Foster, Dean P.
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2014, : 525 - 532