SoftVoting6mA: An improved ensemble-based method for predicting DNA N6-methyladenine sites in cross-species genomes

被引:1
|
作者
Yin Z. [1 ]
Lyu J. [1 ]
Zhang G. [1 ]
Huang X. [1 ]
Ma Q. [2 ,3 ]
Jiang J. [1 ]
机构
[1] College of Information Science and Engineering, Shaoyang University, Shaoyang
[2] College of Information Science and Engineering, Hohai University, Nanjing
[3] Faculty of Information Technology, University of Jyvaskyla, Jyvaskyla
关键词
convolution neural network; cross-species; DNA N6-methyladenine; feature fusion; soft voting; webserver;
D O I
10.3934/mbe.2024169
中图分类号
学科分类号
摘要
The DNA N6-methyladenine (6mA) is an epigenetic modification, which plays a pivotal role in biological processes encompassing gene expression, DNA replication, repair, and recombination. Therefore, the precise identification of 6mA sites is fundamental for better understanding its function, but challenging. We proposed an improved ensemble-based method for predicting DNA N6-methyladenine sites in cross-species genomes called SoftVoting6mA. The SoftVoting6mA selected four (electron–ion-interaction pseudo potential, One-hot encoding, Kmer, and pseudo dinucleotide composition) codes from 15 types of encoding to represent DNA sequences by comparing their performances. Similarly, the SoftVoting6mA combined four learning algorithms using the soft voting strategy. The 5-fold cross-validation and the independent tests showed that SoftVoting6mA reached the state-of-the-art performance. To enhance accessibility, a user-friendly web server is provided at http://www.biolscience.cn/SoftVoting6mA/. © 2024 the Author(s).
引用
收藏
页码:3798 / 3815
页数:17
相关论文
共 40 条
  • [21] Ense-i6mA: Identification of DNA N6-Methyladenine Sites Using XGB-RFE Feature Selection and Ensemble Machine Learning
    Fan, Xueqiang
    Lin, Bing
    Hu, Jun
    Guo, Zhongyi
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2024, 21 (06) : 1842 - 1854
  • [22] EpiSemble: A Novel Ensemble-based Machine-learning Framework for Prediction of DNA N6-methyladenine Sites Using Hybrid Features Selection Approach for Crops
    Sinha, Dipro
    Dasmandal, Tanwy
    Yeasin, Md
    Mishra, Dwijesh C.
    Rai, Anil
    Archak, Sunil
    CURRENT BIOINFORMATICS, 2023, 18 (07) : 587 - 597
  • [23] GC6mA-Pred: A deep learning approach to identify DNA N6-methyladenine sites in the rice genome
    Cai, Jianhua
    Xiao, Guobao
    Su, Ran
    METHODS, 2022, 204 : 14 - 21
  • [24] SNN6mA: Improved DNA N6-methyladenine site prediction using Siamese network-based feature embedding
    Yu, Xuan
    Hu, Jun
    Zhang, Ying
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 166
  • [25] StackRAM: a cross-species method for identifying RNA N6-methyladenosine sites based on stacked ensemble
    Zhang, Yaqun
    Yu, Zhaomin
    Yu, Bin
    Wang, Xue
    Gao, Hongli
    Sun, Jianqiang
    Li, Shuangyi
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2022, 222
  • [26] iDNA6mA-Rice-DL: A local web server for identifying DNA N6-methyladenine sites in rice genome by deep learning method
    He, Shiqian
    Kong, Liang
    Chen, Jing
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2021, 19 (05)
  • [27] BERT6mA: prediction of DNA N6-methyladenine site using deep learning-based approaches
    Tsukiyama, Sho
    Hasan, Md Mehedi
    Deng, Hong-Wen
    Kurata, Hiroyuki
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (02)
  • [28] 6mAPred-MSFF: A Deep Learning Model for Predicting DNA N6-Methyladenine Sites across Species Based on a Multi-Scale Feature Fusion Mechanism
    Zeng, Rao
    Liao, Minghong
    APPLIED SCIENCES-BASEL, 2021, 11 (16):
  • [29] i6mA-DNCP: Computational Identification of DNA N6-Methyladenine Sites in the Rice Genome Using Optimized Dinucleotide-Based Features
    Kong, Liang
    Zhang, Lichao
    GENES, 2019, 10 (10)
  • [30] Meta-i6mA: an interspecies predictor for identifying DNA N6-methyladenine sites of plant genomes by exploiting informative features in an integrative machine-learning framework
    Hasan, Md Mehedi
    Basith, Shaherin
    Khatun, Mst Shamima
    Lee, Gwang
    Manavalan, Balachandran
    Kurata, Hiroyuki
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (03)