SoftVoting6mA: An improved ensemble-based method for predicting DNA N6-methyladenine sites in cross-species genomes

被引:1
|
作者
Yin Z. [1 ]
Lyu J. [1 ]
Zhang G. [1 ]
Huang X. [1 ]
Ma Q. [2 ,3 ]
Jiang J. [1 ]
机构
[1] College of Information Science and Engineering, Shaoyang University, Shaoyang
[2] College of Information Science and Engineering, Hohai University, Nanjing
[3] Faculty of Information Technology, University of Jyvaskyla, Jyvaskyla
关键词
convolution neural network; cross-species; DNA N6-methyladenine; feature fusion; soft voting; webserver;
D O I
10.3934/mbe.2024169
中图分类号
学科分类号
摘要
The DNA N6-methyladenine (6mA) is an epigenetic modification, which plays a pivotal role in biological processes encompassing gene expression, DNA replication, repair, and recombination. Therefore, the precise identification of 6mA sites is fundamental for better understanding its function, but challenging. We proposed an improved ensemble-based method for predicting DNA N6-methyladenine sites in cross-species genomes called SoftVoting6mA. The SoftVoting6mA selected four (electron–ion-interaction pseudo potential, One-hot encoding, Kmer, and pseudo dinucleotide composition) codes from 15 types of encoding to represent DNA sequences by comparing their performances. Similarly, the SoftVoting6mA combined four learning algorithms using the soft voting strategy. The 5-fold cross-validation and the independent tests showed that SoftVoting6mA reached the state-of-the-art performance. To enhance accessibility, a user-friendly web server is provided at http://www.biolscience.cn/SoftVoting6mA/. © 2024 the Author(s).
引用
收藏
页码:3798 / 3815
页数:17
相关论文
共 40 条
  • [1] i6mA-Vote: Cross-Species Identification of DNA N6-Methyladenine Sites in Plant Genomes Based on Ensemble Learning With Voting
    Teng, Zhixia
    Zhao, Zhengnan
    Li, Yanjuan
    Tian, Zhen
    Guo, Maozu
    Lu, Qianzi
    Wang, Guohua
    FRONTIERS IN PLANT SCIENCE, 2022, 13
  • [2] Deep transformers and convolutional neural network in identifying DNA N6-methyladenine sites in cross-species genomes
    Le, Nguyen Quoc Khanh
    Ho, Quang-Thai
    METHODS, 2022, 204 : 199 - 206
  • [3] 6mA-Finder: a novel online tool for predicting DNA N6-methyladenine sites in genomes
    Xu, Haodong
    Hu, Ruifeng
    Jia, Peilin
    Zhao, Zhongming
    BIOINFORMATICS, 2020, 36 (10) : 3257 - 3259
  • [4] 6mA-StackingCV: an improved stacking ensemble model for predicting DNA N6-methyladenine site
    Guohua Huang
    Xiaohong Huang
    Wei Luo
    BioData Mining, 16
  • [5] i6mA-stack: A stacking ensemble-based computational prediction of DNA N6-methyladenine (6mA) sites in the Rosaceae genome
    Khanal, Jhabindra
    Lim, Dae Young
    Tayara, Hilal
    Chong, Kil To
    GENOMICS, 2021, 113 (01) : 582 - 592
  • [6] 6mA-StackingCV: an improved stacking ensemble model for predicting DNA N6-methyladenine site
    Huang, Guohua
    Huang, Xiaohong
    Luo, Wei
    BIODATA MINING, 2023, 16 (01)
  • [7] SpineNet-6mA: A Novel Deep Learning Tool for Predicting DNA N6-Methyladenine Sites in Genomes
    Abbas, Zeeshan
    Tayara, Hilal
    Chong, Kil To
    IEEE ACCESS, 2020, 8 : 201450 - 201457
  • [8] Plant6mA: A predictor for predicting N6-methyladenine sites with lightweight structure in plant genomes
    Shi, Hua
    Li, Shuang
    Su, Xi
    METHODS, 2022, 204 : 126 - 131
  • [9] SNNRice6mA: A Deep Learning Method for Predicting DNA N6-Methyladenine Sites in Rice Genome
    Yu, Haitao
    Dai, Zhiming
    FRONTIERS IN GENETICS, 2019, 10
  • [10] A Deep Learning Model for Predicting DNA N6-Methyladenine (6mA) Sites in Eukaryotes
    Roland, Lokuthota Hewage
    Wannige, Champi Thusangi
    IEEE ACCESS, 2020, 8 : 175535 - 175545