GEFormer: A genotype-environment interaction- based genomic prediction method that integrates the gating multilayer perceptron and linear attention mechanisms

被引:0
|
作者
Yao, Zhou [1 ,2 ,3 ]
Yao, Mengting [3 ]
Wang, Chuang [1 ,2 ,3 ]
Li, Ke [3 ]
Guo, Junhao [3 ]
Xiao, Yingjie [1 ,4 ]
Yan, Jianbing [1 ,4 ]
Liu, Jianxiao [1 ,2 ,3 ,4 ]
机构
[1] Huazhong Agr Univ, Natl Key Lab Crop Genet Improvement, Wuhan 430070, Peoples R China
[2] Huazhong Agr Univ, Hubei Key Lab Agr Bioinformat, Wuhan 430070, Peoples R China
[3] Huazhong Agr Univ, Coll Informat, Wuhan 430070, Peoples R China
[4] Hubei Hongshan Lab, Wuhan 430070, Peoples R China
关键词
genomic prediction; crop growth environment; genotype-environment interactions; gated MLP; linear attention mechanism; MODELS; PERFORMANCE; SELECTION; ACCURACY; GENETICS; IMPACTS;
D O I
10.1016/j.molp.2025.01.020
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The integration of genotypic and environmental data can enhance genomic prediction accuracy for crop field traits. Existing genomic prediction methods fail to consider environmental factors and the real growth environments of crops, resulting in low genomic prediction accuracy. In this work, we developed GEFormer, a genotype-environment interaction genomic prediction method that integrates gating multilayer perceptron (gMLP) and linear attention mechanisms. First, GEFormer uses gMLP to extract local and global features among SNPs. Then, Omni-dimensional Dynamic Convolution is used to extract the dynamic and comprehensive features of multiple environmental factors within each day, taking into consideration the real growth pattern of crops. A linear attention mechanism is used to capture the temporal features of environmental changes. Finally, GEFormer uses a gating mechanism to effectively fuse the genomic and environmental features. We examined the accuracy of GEFormer for predicting important agronomic traits of maize, rice, and wheat under three experimental scenarios: untested genotypes in tested environments, tested genotypes in untested environments, and untested genotypes in untested environments. The results showed that GEFormer outperforms six cutting-edge statistical learning methods and four machine learning methods, especially with great advantages under the scenario of untested genotypes in untested environments. In addition, we used GEFormer for three realtion using an inbred population, and cross-population phenotype prediction. The results showed that GEFormer had better prediction performance in actual breeding scenarios and could be used to assist in crop breeding.
引用
收藏
页码:527 / 549
页数:23
相关论文
共 2 条
  • [1] EMPIRICAL METHOD OF GROUPING GENOTYPES BASED ON A LINEAR FUNCTION OF GENOTYPE-ENVIRONMENT INTERACTION
    LIN, CS
    THOMPSON, B
    HEREDITY, 1975, 34 (APR) : 255 - 263
  • [2] A study on genotype-environment interaction based on GGE biplot graphical method in sunflower genotypes (Helianthus annuus L.)
    Ansarifard, Isa
    Mostafavi, Khodadad
    Khosroshahli, Mahmood
    Reza Bihamta, Mohammad
    Ramshini, Hosein
    FOOD SCIENCE & NUTRITION, 2020, 8 (07): : 3327 - 3334