Machine learning models identify gene predictors of waggle dance behaviour in honeybees

被引:8
|
作者
Veiner, Marcell [1 ]
Morimoto, Juliano [2 ]
Leadbeater, Ellouise [3 ]
Manfredini, Fabio [2 ,3 ]
机构
[1] Univ Aberdeen, Sch Nat & Comp Sci, Aberdeen, Scotland
[2] Univ Aberdeen, Sch Biol Sci, Aberdeen, Scotland
[3] Royal Holloway Univ London, Sch Biol Sci, Egham, Surrey, England
基金
英国自然环境研究理事会; 欧盟地平线“2020”; 欧洲研究理事会;
关键词
bioinfomatics; feature selection; genomics; gene structure and function; insects; social evolution; SOCIAL-BEHAVIOR; MUSHROOM BODIES; SELECTION; NAVIGATION; EXPRESSION; EVOLUTION; PROTEIN; GENOME; FLIGHT;
D O I
10.1111/1755-0998.13611
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The molecular characterization of complex behaviours is a challenging task as a range of different factors are often involved to produce the observed phenotype. An established approach is to look at the overall levels of expression of brain genes-or 'neurogenomics'-to select the best candidates that associate with patterns of interest. However, traditional neurogenomic analyses have some well-known limitations: above all, the usually limited number of biological replicates compared to the number of genes tested-known as the "curse of dimensionality." In this study we implemented a machine learning (ML) approach that can be used as a complement to more established methods of transcriptomic analyses. We tested three supervised learning algorithms (Random Forests, Lasso and Elastic net Regularized Generalized Linear Model, and Support Vector Machine) for their performance in the characterization of transcriptomic patterns and identification of genes associated with honeybee waggle dance. We then matched the results of these analyses with traditional outputs of differential gene expression analyses and identified two promising candidates for the neural regulation of the waggle dance: boss and hnRNP A1. Overall, our study demonstrates the application of ML to analyse transcriptomics data and identify candidate genes underlying social behaviour. This approach has great potential for application to a wide range of different scenarios in evolutionary ecology, when investigating the genomic basis for complex phenotypic traits, and can present some clear advantages compared to the established tools of gene expression analysis, making it a valuable complement for future studies.
引用
收藏
页码:2248 / 2261
页数:14
相关论文
共 50 条
  • [1] Machine learning and phylogenetic models identify predictors of genetic variation in Neotropical amphibians
    Amador, Luis
    Arroyo-Torres, Irvin
    Barrow, Lisa N.
    JOURNAL OF BIOGEOGRAPHY, 2024, 51 (05) : 909 - 923
  • [2] Machine learning reveals the waggle drift's role in the honey bee dance communication system
    Dormagen, David M.
    Wild, Benjamin
    Wario, Fernando
    Landgraf, Tim
    PNAS NEXUS, 2023, 2 (09):
  • [3] Machine Learning Methods to Identify Predictors of Psychological Distress
    Chen, Yang
    Zhang, Xiaomei
    Lu, Lin
    Wang, Yinzhi
    Liu, Jiajia
    Qin, Lei
    Ye, Linglong
    Zhu, Jianping
    Shia, Ben-Chang
    Chen, Ming-Chih
    PROCESSES, 2022, 10 (05)
  • [5] Applying Machine Learning Models to Identify Forest Cover
    Johnson, Peter
    Abdelfattah, Eman
    2018 9TH IEEE ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2018, : 471 - 474
  • [6] Different learning predictors and their effects for Moodle Machine Learning models
    Bognar, Laszlo
    Fauszt, Tibor
    2020 11TH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM 2020), 2020, : 405 - 409
  • [7] A combined strategy of feature selection and machine learning to identify predictors of prediabetes
    De Silva, Kushan
    Jonsson, Daniel
    Demmer, Ryan T.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (03) : 396 - 406
  • [8] Combining machine learning with Cox models to identify predictors for incident post-menopausal breast cancer in the UK Biobank
    Liu, Xiaonan
    Morelli, Davide
    Littlejohns, Thomas J.
    Clifton, David A.
    Clifton, Lei
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [9] Combining machine learning with Cox models to identify predictors for incident post-menopausal breast cancer in the UK Biobank
    Xiaonan Liu
    Davide Morelli
    Thomas J. Littlejohns
    David A. Clifton
    Lei Clifton
    Scientific Reports, 13
  • [10] Machine Learning Models Identify Predictors of Poor Outcome in Patients Undergoing Mechanical Thrombectomy for Acute Ischemic Stroke.
    Teitcher, Michael
    Kubicki, Konrad
    Saini, Pravesh
    Sharba, Lana
    Singh, Jasmine
    Taylor, William
    Syed, Anam
    de la Pena, Paula
    Thomas, Shon
    Serrone, Joseph
    Garg, Ravi
    Gill, Rick
    STROKE, 2021, 52