Machine learning models identify gene predictors of waggle dance behaviour in honeybees

被引:8
|
作者
Veiner, Marcell [1 ]
Morimoto, Juliano [2 ]
Leadbeater, Ellouise [3 ]
Manfredini, Fabio [2 ,3 ]
机构
[1] Univ Aberdeen, Sch Nat & Comp Sci, Aberdeen, Scotland
[2] Univ Aberdeen, Sch Biol Sci, Aberdeen, Scotland
[3] Royal Holloway Univ London, Sch Biol Sci, Egham, Surrey, England
基金
英国自然环境研究理事会; 欧盟地平线“2020”; 欧洲研究理事会;
关键词
bioinfomatics; feature selection; genomics; gene structure and function; insects; social evolution; SOCIAL-BEHAVIOR; MUSHROOM BODIES; SELECTION; NAVIGATION; EXPRESSION; EVOLUTION; PROTEIN; GENOME; FLIGHT;
D O I
10.1111/1755-0998.13611
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The molecular characterization of complex behaviours is a challenging task as a range of different factors are often involved to produce the observed phenotype. An established approach is to look at the overall levels of expression of brain genes-or 'neurogenomics'-to select the best candidates that associate with patterns of interest. However, traditional neurogenomic analyses have some well-known limitations: above all, the usually limited number of biological replicates compared to the number of genes tested-known as the "curse of dimensionality." In this study we implemented a machine learning (ML) approach that can be used as a complement to more established methods of transcriptomic analyses. We tested three supervised learning algorithms (Random Forests, Lasso and Elastic net Regularized Generalized Linear Model, and Support Vector Machine) for their performance in the characterization of transcriptomic patterns and identification of genes associated with honeybee waggle dance. We then matched the results of these analyses with traditional outputs of differential gene expression analyses and identified two promising candidates for the neural regulation of the waggle dance: boss and hnRNP A1. Overall, our study demonstrates the application of ML to analyse transcriptomics data and identify candidate genes underlying social behaviour. This approach has great potential for application to a wide range of different scenarios in evolutionary ecology, when investigating the genomic basis for complex phenotypic traits, and can present some clear advantages compared to the established tools of gene expression analysis, making it a valuable complement for future studies.
引用
收藏
页码:2248 / 2261
页数:14
相关论文
共 50 条
  • [21] A Machine Learning Approach to Identify Predictors of Frequent Vaping and Vulnerable Californian Youth Subgroups
    Fu, Rui
    Shi, Jiamin
    Chaiton, Michael
    Leventhal, Adam M.
    Unger, Jennifer B.
    Barrington-Trimis, Jessica L.
    NICOTINE & TOBACCO RESEARCH, 2022, 24 (07) : 1028 - 1036
  • [22] Using Innovative Machine Learning Methods to Screen and Identify Predictors of Congenital Heart Diseases
    Qu, Yanji
    Deng, Xinlei
    Lin, Shao
    Han, Fengzhen
    Chang, Howard H.
    Ou, Yanqiu
    Nie, Zhiqiang
    Mai, Jinzhuang
    Wang, Ximeng
    Gao, Xiangmin
    Wu, Yong
    Chen, Jimei
    Zhuang, Jian
    Ryan, Ian
    Liu, Xiaoqing
    FRONTIERS IN CARDIOVASCULAR MEDICINE, 2022, 8
  • [23] Land subsidence hazard modeling: Machine learning to identify predictors and the role of human activities
    Rahmati, Omid
    Golkarian, Ali
    Biggs, Trent
    Keesstra, Saskia
    Mohammadi, Farnoush
    Daliakopoulos, Ioannis N.
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2019, 236 : 466 - 480
  • [24] Interpretable machine learning to identify important predictors of birth weight: A prospective cohort study
    Liu, Zheng
    Han, Na
    Su, Tao
    Ji, Yuelong
    Bao, Heling
    Zhou, Shuang
    Luo, Shusheng
    Wang, Hui
    Liu, Jue
    Wang, Hai-Jun
    FRONTIERS IN PEDIATRICS, 2022, 10
  • [25] Is Infidelity Predictable? Using Explainable Machine Learning to Identify the Most Important Predictors of Infidelity
    Vowels, Laura M.
    Vowels, Matthew J.
    Mark, Kristen P.
    JOURNAL OF SEX RESEARCH, 2022, 59 (02) : 224 - 237
  • [26] Using machine learning to identify top predictors for nurses' willingness to report medication errors
    Hu, Renjie
    Farag, Amany
    Bjork, Kaj-Mikael
    Lendasse, Amaury
    ARRAY, 2020, 8
  • [27] Machine Learning Models for Development of Gene and Cell Therapies
    Taylor, Albert A.
    Jahandideh, Samad
    Beaulieu, Danielle
    Keymer, Mike
    Ennist, David L.
    MOLECULAR THERAPY, 2017, 25 (05) : 77 - 77
  • [28] Evaluating Plant Gene Models Using Machine Learning
    Upadhyaya, Shriprabha R.
    Bayer, Philipp E.
    Tay Fernandez, Cassandria G.
    Petereit, Jakob
    Batley, Jacqueline
    Bennamoun, Mohammed
    Boussaid, Farid
    Edwards, David
    PLANTS-BASEL, 2022, 11 (12):
  • [29] Assessment of machine learning models as predictors of radioiodine thyroid ablation outcomes
    Vavrova, L.
    Tapprogge, J.
    Rushforth, D.
    Flux, G.
    EUROPEAN JOURNAL OF NUCLEAR MEDICINE AND MOLECULAR IMAGING, 2024, 51 : S780 - S780
  • [30] EVALUATION OF MACHINE LEARNING MODELS TO IDENTIFY PEACH VARIETIES BASED ON LEAF COLOR
    Ayala-Nino, Daniel
    Manuel Gonzalez-Camacho, Juan
    AGROCIENCIA, 2022, 56 (04) : 669 - 703