Machine learning models identify gene predictors of waggle dance behaviour in honeybees

被引:8
|
作者
Veiner, Marcell [1 ]
Morimoto, Juliano [2 ]
Leadbeater, Ellouise [3 ]
Manfredini, Fabio [2 ,3 ]
机构
[1] Univ Aberdeen, Sch Nat & Comp Sci, Aberdeen, Scotland
[2] Univ Aberdeen, Sch Biol Sci, Aberdeen, Scotland
[3] Royal Holloway Univ London, Sch Biol Sci, Egham, Surrey, England
基金
英国自然环境研究理事会; 欧盟地平线“2020”; 欧洲研究理事会;
关键词
bioinfomatics; feature selection; genomics; gene structure and function; insects; social evolution; SOCIAL-BEHAVIOR; MUSHROOM BODIES; SELECTION; NAVIGATION; EXPRESSION; EVOLUTION; PROTEIN; GENOME; FLIGHT;
D O I
10.1111/1755-0998.13611
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The molecular characterization of complex behaviours is a challenging task as a range of different factors are often involved to produce the observed phenotype. An established approach is to look at the overall levels of expression of brain genes-or 'neurogenomics'-to select the best candidates that associate with patterns of interest. However, traditional neurogenomic analyses have some well-known limitations: above all, the usually limited number of biological replicates compared to the number of genes tested-known as the "curse of dimensionality." In this study we implemented a machine learning (ML) approach that can be used as a complement to more established methods of transcriptomic analyses. We tested three supervised learning algorithms (Random Forests, Lasso and Elastic net Regularized Generalized Linear Model, and Support Vector Machine) for their performance in the characterization of transcriptomic patterns and identification of genes associated with honeybee waggle dance. We then matched the results of these analyses with traditional outputs of differential gene expression analyses and identified two promising candidates for the neural regulation of the waggle dance: boss and hnRNP A1. Overall, our study demonstrates the application of ML to analyse transcriptomics data and identify candidate genes underlying social behaviour. This approach has great potential for application to a wide range of different scenarios in evolutionary ecology, when investigating the genomic basis for complex phenotypic traits, and can present some clear advantages compared to the established tools of gene expression analysis, making it a valuable complement for future studies.
引用
收藏
页码:2248 / 2261
页数:14
相关论文
共 50 条
  • [31] Machine Learning Models Identify Gut Microbiota That Predict Chronicity in Reactive Arthritis
    Prakashini, M., V
    Mahapatra, Soumendu
    Murmu, Krushna Chandra
    Mishra, Rasmita
    Padhan, Prasanta
    Prasad, Punit
    Misra, Ramnath
    Ahmed, Sakir
    ARTHRITIS & RHEUMATOLOGY, 2023, 75 : 3540 - 3544
  • [32] Machine Learning Models Identify Inhibitors of New Delhi Metallo-β-lactamase
    Cheng, Zishuo
    Aitha, Mahesh
    Thomas, Caitlyn A.
    Sturgill, Aidan
    Fairweather, Mitch
    Hu, Amy
    Bethel, Christopher R.
    Rivera, Dann D.
    Dranchak, Patricia
    Thomas, Pei W.
    Li, Han
    Feng, Qi
    Tao, Kaicheng
    Song, Minshuai
    Sun, Na
    Wang, Shuo
    Silwal, Surendra Bikram
    Page, Richard C.
    Fast, Walt
    Bonomo, Robert A.
    Weese, Maria
    Martinez, Waldyn
    Inglese, James
    Crowder, Michael W.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2024, 64 (10) : 3977 - 3991
  • [33] Gene embedding: A novel machine learning approach to identify gene candidates related to immunotherapy responsiveness
    Choy, C. T.
    Wong, C. H.
    Chan, S. L.
    ANNALS OF ONCOLOGY, 2018, 29 : 22 - 22
  • [34] Validation of Machine Learning Models for Structural Dam Behaviour Interpretation and Prediction
    Mata, Juan
    Salazar, Fernando
    Barateiro, Jose
    Antunes, Antonio
    WATER, 2021, 13 (19)
  • [35] DeforestVis: Behaviour Analysis of Machine Learning Models with Surrogate Decision Stumps
    Chatzimparmpas, Angelos
    Martins, Rafeal M.
    Telea, Alexandru C.
    Kerren, Andreas
    COMPUTER GRAPHICS FORUM, 2024, 43 (06)
  • [36] Ensemble Machine Learning Models for Root Note Detection in Irish Instrumental Dance Music
    Shahid, Abdul
    Diamond, Danny
    McDermott, James
    d'Aquin, Mathieu
    2023 31ST IRISH CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COGNITIVE SCIENCE, AICS, 2023,
  • [37] Using machine learning to identify gene interaction networks associated with breast cancer
    Liyuan Liu
    Wenli Zhai
    Fei Wang
    Lixiang Yu
    Fei Zhou
    Yujuan Xiang
    Shuya Huang
    Chao Zheng
    Zhongshang Yuan
    Yong He
    Zhigang Yu
    Jiadong Ji
    BMC Cancer, 22
  • [38] Using machine learning to identify gene interaction networks associated with breast cancer
    Liu, Liyuan
    Zhai, Wenli
    Wang, Fei
    Yu, Lixiang
    Zhou, Fei
    Xiang, Yujuan
    Huang, Shuya
    Zheng, Chao
    Yuan, Zhongshang
    He, Yong
    Yu, Zhigang
    Ji, Jiadong
    BMC CANCER, 2022, 22 (01)
  • [39] Machine learning approach to identify early predictors of MS progression: the NeuroArtP3 project
    Poretto, Valentina
    Lapucci, Caterina
    Betti, Matteo
    Bellinvia, Angelo
    Endrizzi, Walter
    Ragni, Flavio
    Bovo, Stefano
    Longo, Chiara
    Carpi, Elisabetta
    Moroni, Monica
    Chierici, Marco
    Jurman, Giuseppe
    Osmani, Venet
    Piana, Michele
    Marenco, Manuela
    Marangoni, Sabrina
    Portaccio, Emilio
    Giometto, Bruno
    Inglese, Matilde
    Antonio, Ucccelli
    Amato, Maria Pia
    MULTIPLE SCLEROSIS JOURNAL, 2024, 30 (03) : 997 - 997
  • [40] Use of machine learning techniques to identify HIV predictors for screening in sub-Saharan Africa
    Charles K. Mutai
    Patrick E. McSharry
    Innocent Ngaruye
    Edouard Musabanganji
    BMC Medical Research Methodology, 21