Machine Learning Applied to the Search for Nonlinear Features in Breeding Populations

被引:7
|
作者
Gabur, Iulian [1 ,2 ]
Simioniuc, Danut Petru [2 ]
Snowdon, Rod J. [1 ]
Cristea, Dan [3 ]
机构
[1] Justus Liebig Univ, Dept Plant Breeding, Giessen, Germany
[2] Iasi Univ Life Sci, Dept Plant Sci, Iasi, Romania
[3] Romanian Acad, Inst Comp Sci, Iasi Branch, Iasi, Romania
来源
关键词
machine learning; feature selection; linear models; genomic selection; wheat; oilseed rape; GENOME-ENABLED PREDICTION; REGRESSION-MODELS; SELECTION; PERFORMANCE; INFORMATION; SHRINKAGE;
D O I
10.3389/frai.2022.876578
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large plant breeding populations are traditionally a source of novel allelic diversity and are at the core of selection efforts for elite material. Finding rare diversity requires a deep understanding of biological interactions between the genetic makeup of one genotype and its environmental conditions. Most modern breeding programs still rely on linear regression models to solve this problem, generalizing the complex genotype by phenotype interactions through manually constructed linear features. However, the identification of positive alleles vs. background can be addressed using deep learning approaches that have the capacity to learn complex nonlinear functions for the inputs. Machine learning (ML) is an artificial intelligence (AI) approach involving a range of algorithms to learn from input data sets and predict outcomes in other related samples. This paper describes a variety of techniques that include supervised and unsupervised ML algorithms to improve our understanding of nonlinear interactions from plant breeding data sets. Feature selection (FS) methods are combined with linear and nonlinear predictors and compared to traditional prediction methods used in plant breeding. Recent advances in ML allowed the construction of complex models that have the capacity to better differentiate between positive alleles and the genetic background. Using real plant breeding program data, we show that ML methods have the ability to outperform current approaches, increase prediction accuracies, decrease the computing time drastically, and improve the detection of important alleles involved in qualitative or quantitative traits.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Image analysis of seeds and machine learning as a tool for distinguishing populations: Applied to an invasive tree species
    Felix, Francival Cardoso
    das Chagas, Kyvia Pontes Teixeira
    Araujo, Fernando dos Santos
    de Medeiros, Josenilda Aprigio Dantas
    Vieira, Fabio de Almeida
    Torres, Salvador Barros
    Pacheco, Mauro Vasconcelos
    ACTA SCIENTIARUM-AGRONOMY, 2023, 46
  • [22] Machine learning and deep learning applied in ultrasound
    Pehrson, Lea Marie
    Lauridsen, Carsten
    Nielsen, Michael Bachmann
    ULTRASCHALL IN DER MEDIZIN, 2018, 39 (04): : 379 - 381
  • [23] Hide and seek shark teeth in Random Forests: machine learning applied to Scyliorhinus canicula populations
    Berio, Fidji
    Bayle, Yann
    Baum, Daniel
    Goudemand, Nicolas
    Debiais-Thibaud, Melanie
    PEERJ, 2022, 10
  • [24] The Democratization of Machine Learning Features
    Patel, Jayesh
    2020 IEEE 21ST INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2020), 2020, : 136 - 141
  • [25] Unification of Machine Learning Features
    Patel, Jayesh
    2020 IEEE 44TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2020), 2020, : 1201 - 1205
  • [26] Machine learning applied to property prediction of metal additive manufacturing products with textural features extraction
    Chang, Lien-Kai
    Chen, Ri-Sheng
    Tsai, Mi-Ching
    Lee, Rong-Mao
    Lin, Ching-Chih
    Huang, Jhih-Cheng
    Chang, Tsung-Wei
    Horng, Ming-Huwi
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2024, 132 (1-2): : 83 - 98
  • [27] Linear and nonlinear features and machine learning for wind turbine blade ice detection and diagnosis
    Arcos Jimenez, Alfredo
    Garcia Marquez, Fausto Pedro
    Borja Moraleda, Victoria
    Gomez Munoz, Carlos Quiterio
    RENEWABLE ENERGY, 2019, 132 : 1034 - 1048
  • [28] Machine learning applied to property prediction of metal additive manufacturing products with textural features extraction
    Lien-Kai Chang
    Ri-Sheng Chen
    Mi-Ching Tsai
    Rong-Mao Lee
    Ching-Chih Lin
    Jhih-Cheng Huang
    Tsung-Wei Chang
    Ming-Huwi Horng
    The International Journal of Advanced Manufacturing Technology, 2024, 132 : 83 - 98
  • [29] Machine Learning Approach for Prescriptive Plant Breeding
    Parmley, Kyle A.
    Higgins, Race H.
    Ganapathysubramanian, Baskar
    Sarkar, Soumik
    Singh, Asheesh K.
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [30] Machine learning in plant science and plant breeding
    van Dijk, Aalt Dirk Jan
    Kootstra, Gert
    Kruijer, Willem
    de Ridder, Dick
    ISCIENCE, 2021, 24 (01)