DNA promoter task-oriented dictionary mining and prediction model based on natural language technology

被引:0
|
作者
Zeng, Ruolei [1 ]
Li, Zihan [2 ]
Li, Jialu [2 ]
Zhang, Qingchuan [2 ]
机构
[1] Univ Minnesota, Dept Comp Sci & Engn, Minneapolis, MN 55455 USA
[2] Beijing Technol & Business Univ, Natl Engn Res Ctr Agriprod Qual Traceabil, 11 Fucheng Rd, Beijing 100048, Peoples R China
来源
SCIENTIFIC REPORTS | 2025年 / 15卷 / 01期
基金
国科技部“十一五”科技计划项目;
关键词
NEURAL-NETWORK;
D O I
10.1038/s41598-024-84105-9
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Promoters are essential DNA sequences that initiate transcription and regulate gene expression. Precisely identifying promoter sites is crucial for deciphering gene expression patterns and the roles of gene regulatory networks. Recent advancements in bioinformatics have leveraged deep learning and natural language processing (NLP) to enhance promoter prediction accuracy. Techniques such as convolutional neural networks (CNNs), long short-term memory (LSTM) networks, and BERT models have been particularly impactful. However, current approaches often rely on arbitrary DNA sequence segmentation during BERT pre-training, which may not yield optimal results. To overcome this limitation, this article introduces a novel DNA sequence segmentation method. This approach develops a more refined dictionary for DNA sequences, utilizes it for BERT pre-training, and employs an Inception neural network as the foundational model. This BERT-Inception architecture captures information across multiple granularities. Experimental results show that the model improves the performance of several downstream tasks and introduces deep learning interpretability, providing new perspectives for interpreting and understanding DNA sequence information. The detailed source code is available at https://github.com/katouMegumiH/Promoter_BERT.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Task-Oriented Dialogue System as Natural Language Generation
    Wang, Weizhi
    Zhang, Zhirui
    Guo, Junliang
    Dai, Yinpei
    Chen, Boxing
    Luo, Weihua
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 2698 - 2703
  • [2] Accelerating Natural Language Understanding in Task-Oriented Dialog
    Ahuja, Ojas
    Desai, Shrey
    NLP FOR CONVERSATIONAL AI, 2020, : 46 - 53
  • [3] Natural Language Generation for Socially Competent Task-Oriented Agent
    Vanel, Lorraine
    2023 11TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS, ACIIW, 2023,
  • [4] Task-Oriented Grasp Prediction with Visual-Language Inputs
    Tang, Chao
    Huang, Dehao
    Meng, Lingxiao
    Liu, Weiyu
    Zhang, Hong
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 4881 - 4888
  • [5] Multi-task Learning for Natural Language Generation in Task-Oriented Dialogue
    Zhu, Chenguang
    Zeng, Michael
    Huang, Xuedong
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1261 - 1266
  • [6] Few-shot Natural Language Generation for Task-Oriented Dialog
    Peng, Baolin
    Zhu, Chenguang
    Li, Chunyuan
    Li, Xiujun
    Li, Jinchao
    Zeng, Michael
    Gao, Jianfeng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 172 - 182
  • [7] Continual Learning for Natural Language Generation in Task-oriented Dialog Systems
    Mi, Fei
    Chen, Liangwei
    Zhao, Mengjie
    Huang, Minlie
    Faltings, Boi
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020,
  • [8] A task oriented natural language understanding model
    Zeng, YM
    Wang, YC
    Wu, FF
    ADVANCES IN MULTIMODAL INTERFACES - ICMI 2000, PROCEEDINGS, 2000, 1948 : 260 - 266
  • [9] Few-Shot Language Understanding Model for Task-Oriented Dialogues
    Xiang Z.
    Chen H.
    Wang Q.
    Li N.
    Data Analysis and Knowledge Discovery, 2023, 7 (09) : 64 - 77
  • [10] How to Make Neural Natural Language Generation as Reliable as Templates in Task-Oriented Dialogue
    Elder, Henry
    O'Connor, Alexander
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 2877 - 2888