DNA promoter task-oriented dictionary mining and prediction model based on natural language technology

被引:0
|
作者
Zeng, Ruolei [1 ]
Li, Zihan [2 ]
Li, Jialu [2 ]
Zhang, Qingchuan [2 ]
机构
[1] Univ Minnesota, Dept Comp Sci & Engn, Minneapolis, MN 55455 USA
[2] Beijing Technol & Business Univ, Natl Engn Res Ctr Agriprod Qual Traceabil, 11 Fucheng Rd, Beijing 100048, Peoples R China
来源
SCIENTIFIC REPORTS | 2025年 / 15卷 / 01期
基金
国科技部“十一五”科技计划项目;
关键词
NEURAL-NETWORK;
D O I
10.1038/s41598-024-84105-9
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Promoters are essential DNA sequences that initiate transcription and regulate gene expression. Precisely identifying promoter sites is crucial for deciphering gene expression patterns and the roles of gene regulatory networks. Recent advancements in bioinformatics have leveraged deep learning and natural language processing (NLP) to enhance promoter prediction accuracy. Techniques such as convolutional neural networks (CNNs), long short-term memory (LSTM) networks, and BERT models have been particularly impactful. However, current approaches often rely on arbitrary DNA sequence segmentation during BERT pre-training, which may not yield optimal results. To overcome this limitation, this article introduces a novel DNA sequence segmentation method. This approach develops a more refined dictionary for DNA sequences, utilizes it for BERT pre-training, and employs an Inception neural network as the foundational model. This BERT-Inception architecture captures information across multiple granularities. Experimental results show that the model improves the performance of several downstream tasks and introduces deep learning interpretability, providing new perspectives for interpreting and understanding DNA sequence information. The detailed source code is available at https://github.com/katouMegumiH/Promoter_BERT.
引用
收藏
页数:11
相关论文
共 50 条
  • [11] Task-Based Learning via Task-Oriented Prediction Network with Applications in Finance
    Chen, Di
    Zhu, Yada
    Cui, Xiaodong
    Gomes, Carla P.
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 4476 - 4482
  • [12] ToAM: a task-oriented authentication model for UAVs based on blockchain
    Aiguo Chen
    Kun Peng
    Zexin Sha
    Xincen Zhou
    Zhen Yang
    Guoming Lu
    EURASIP Journal on Wireless Communications and Networking, 2021
  • [13] ToAM: a task-oriented authentication model for UAVs based on blockchain
    Chen, Aiguo
    Peng, Kun
    Sha, Zexin
    Zhou, Xincen
    Yang, Zhen
    Lu, Guoming
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2021, 2021 (01)
  • [14] Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue Systems
    Razumovskaia, Evgeniia
    Glavas, Goran
    Majewska, Olga
    Ponti, Edoardo M.
    Korhonen, Anna
    Vulic, Ivan
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 74 : 1351 - 1402
  • [15] TOD-BERT: Pre-trained Natural Language Understanding for Task-Oriented Dialogue
    Wu, Chien-Sheng
    Hoi, Steven
    Socher, Richard
    Xiong, Caiming
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 917 - 929
  • [16] Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue Systems
    Razumovskaia E.
    Glavaš G.
    Majewska O.
    Ponti E.M.
    Korhonen A.
    Vulic I.
    Journal of Artificial Intelligence Research, 2022, 74 : 1351 - 1402
  • [17] Personality-aware Natural Language Generation for Task-oriented Dialogue using Reinforcement Learning
    Guo, Ao
    Ohashi, Atsumoto
    Chiba, Yuya
    Tsunomori, Yuiko
    Hirai, Ryu
    Higashinaka, Ryuichiro
    2023 32ND IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, RO-MAN, 2023, : 1823 - 1828
  • [18] An integrated discourse recipe-based model for task-oriented dialogue
    Green, N
    Lehman, JF
    DISCOURSE PROCESSES, 2002, 33 (02) : 133 - 158
  • [19] GraspGPT: Leveraging Semantic Knowledge From a Large Language Model for Task-Oriented Grasping
    Tang, Chao
    Huang, Dehao
    Ge, Wenqi
    Liu, Weiyu
    Zhang, Hong
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (11) : 7551 - 7558
  • [20] Multi-domain Language Understanding of Task-Oriented Dialogue Based on Intent Enhancement
    Yu, Feng
    Zheng, Dequan
    Zhao, Xiaotian
    2020 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2020), 2020, : 221 - 228