An intelligent computational model for prediction of promoters and their strength via natural language processing

被引：15

作者：

Tahir, Muhammad ^{[1
,2
]}

Hayat, Maqsood ^{[1
]}

Gul, Sarah ^{[4
]}

Chong, Kil To ^{[2
,3
]}

机构：

[1] Abdul Wali Khan Univ, Dept Comp Sci, Mardan 23200, KP, Pakistan

[2] Chonbuk Natl Univ, Dept Elect & Informat Engn, Jeonju 54896, South Korea

[3] Chonbuk Natl Univ, Adv Elect & Informat Res Ctr, Jeonju 54896, South Korea

[4] Int Islamic Univ, Dept Biol Sci, FBAS, Islamabad, Pakistan

来源：

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS | 2020年 / 202卷

基金：

新加坡国家研究基金会;

关键词：

Promoters; Convolution neural network (CNN); Natural language processing; DNA; word2vec; SEQUENCE-BASED PREDICTOR; RECOMBINATION SPOTS; ENSEMBLE CLASSIFIER; PROTEIN TYPES; IDENTIFICATION; SITES; FEATURES; SPACE; DISCRIMINATION; TRINUCLEOTIDE;

D O I：

10.1016/j.chemolab.2020.104034

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In DNA, a promoter is an essential part of genes that controls the transcription of specific genes in a particular tissue or cells. The combination of RNA polymerase and a number of various proteins named "sigma-factors" can define the transcription start site (TSS) by inducing RNA holoenzyme. Further, Promoter is categorized into strong and weak promoters on the basis of promoter strength. Owing to exponential increase of RNA/DNA and protein samples in the post-genomic era, developing a simple and efficient sequential-based intelligent computational model for the discrimination of promoters is a challenging job. An intelligent computational model namely: 2L-iPSW(word2vec) was introduced for discrimination of promoters and their strength, in this regard. Machine learning and Deep learning algorithms in conjunction with natural language processing method i.e., "word2vec" are used. The proposed computational model 2L-iPSW(word2vec) achieved 91.42% of accuracy for 1st layer contains promoters and non-promoters which is 8.29% higher than the existing model, whereas 82.42% of accuracy for 2nd layer identifies strong promoter and weak promoter which is 11.22% advanced than the present model. Proposed 2L-iPSW(word2vec) model obtained efficient success rates than the present models in terms of all assessment metrics. It is thus greatly observed that the 2L-iPSW(word2vec) model will lead a useful tool for academic research on promoter identification.

引用

页数：7

共 50 条

[31] Natural language processing in an intelligent writing strategy tutoring system
Danielle S. McNamara
Scott A. Crossley
Rod Roscoe
Behavior Research Methods, 2013, 45 : 499 - 515
[32] INTEGRATION OF NATURAL-LANGUAGE AND VISION PROCESSING - INTELLIGENT MULTIMEDIA
MCKEVITT, P
ARTIFICIAL INTELLIGENCE REVIEW, 1995, 9 (2-3) : 77 - 80
[33] Natural language processing in an intelligent writing strategy tutoring system
McNamara, Danielle S.
Crossley, Scott A.
Roscoe, Rod
BEHAVIOR RESEARCH METHODS, 2013, 45 (02) : 499 - 515
[34] Intelligent SPARQL Query Generation for Natural Language Processing Systems
Chen, Yi-Hui
Lu, Eric Jui-Lin
Ou, Ting-An
IEEE ACCESS, 2021, 9 : 158638 - 158650
[35] Research Summary: Intelligent Natural Language Processing Techniques and Tools
Paolucci, Alessio
LOGIC PROGRAMMING, 2009, 5649 : 536 - 537
[36] Intelligent requirement-to-test-case traceability system via Natural Language Processing and Machine Learning
Sawada, Kae
Pomerantz, Marc
Razo, Gus
Clark, Michael W.
2023 IEEE 9TH INTERNATIONAL CONFERENCE ON SPACE MISSION CHALLENGES FOR INFORMATION TECHNOLOGY, SMC-IT, 2023, : 78 - 83
[37] 241Computational Politeness in Natural Language Processing: A Survey
Priya, Priyanshu
Firdaus, Mauajama
Ekbal, Asif
ACM COMPUTING SURVEYS, 2024, 56 (09)
[38] Scaling up Prediction of Psychosis by Natural Language Processing
Si, Dong
Cheng, Sunny Chieh
Xing, Ruiwen
Liu, Chang
Wu, Hoi Yan
2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 339 - 347
[39] Preface to the Special Issue on Computational Linguistics and Natural Language Processing
Revesz, Peter Z.
INFORMATION, 2024, 15 (05)
[40] Natural Language Processing for EHR-Based Computational Phenotyping
Zeng, Zexian
Deng, Yu
Li, Xiaoyu
Naumann, Tristan
Luo, Yuan
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (01) : 139 - 153

← 1 2 3 4 5 →