A new fast technique for pattern matching in biological sequences

被引:0
|
作者
Osman Ali Sadek Ibrahim
Belal A. Hamed
Tarek Abd El-Hafeez
机构
[1] Minia University,Department of Computer Science, Faculty of Science
[2] Deraya University,Computer Science Unit
来源
关键词
Bioinformatics; Character comparison; Pattern matching; String Matching; DNA Sequences;
D O I
暂无
中图分类号
学科分类号
摘要
At numerous phases of the computational process, pattern matching is essential. It enables users to search for specific DNA subsequences or DNA sequences in a database. In addition, some of these rapidly expanding biological databases are updated on a regular basis. Pattern searches can be improved by using high-speed pattern matching algorithms. Researchers are striving to improve solutions in numerous areas of computational bioinformatics as biological data grows exponentially. Faster algorithms with a low error rate are needed in real-world applications. As a result, this study offers two pattern matching algorithms that were created to help speed up DNA sequence pattern searches. The strategies recommended improve performance by utilizing word-level processing rather than character-level processing, which has been used in previous research studies. In terms of time cost, the proposed algorithms (EFLPM and EPAPM) increased performance by leveraging word-level processing with large pattern size. The experimental results show that the proposed methods are faster than other algorithms for short and long patterns. As a result, the EFLPM algorithm is 54% faster than the FLPM method, while the EPAPM algorithm is 39% faster than the PAPM method.
引用
收藏
页码:367 / 388
页数:21
相关论文
共 50 条
  • [21] Quantum-based exact pattern matching algorithms for biological sequences
    Soni, Kapil Kumar
    Rasool, Akhtar
    ETRI JOURNAL, 2021, 43 (03) : 483 - 510
  • [22] Pattern matching in constrained sequences
    Choi, Yongwook
    Szpankowski, Wojciech
    2007 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS, VOLS 1-7, 2007, : 2606 - 2610
  • [23] Fast bitwise pattern-matching algorithm for DNA sequences on modern hardware
    Ozcan, Giyasettin
    Unsal, Osman Sabri
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2015, 23 (05) : 1405 - 1417
  • [24] PERFORMANCE STUDY OF PARALLEL HYBRID MULTIPLE PATTERN MATCHING ALGORITHMS FOR BIOLOGICAL SEQUENCES
    Kouzinopoulos, Charalampos S.
    Michailidis, Panagiotis D.
    Margaritis, Konstantinos G.
    BIOINFORMATICS: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOINFORMATICS MODELS, METHODS AND ALGORITHMS, 2012, : 182 - 187
  • [25] Pattern Matching with Variables: Fast Algorithms and New Hardness Results
    Fernau, Henning
    Manea, Florin
    Mercas, Robert
    Schmid, Markus L.
    32ND INTERNATIONAL SYMPOSIUM ON THEORETICAL ASPECTS OF COMPUTER SCIENCE (STACS 2015), 2015, 30 : 302 - 315
  • [26] Practical fast on-line exact pattern matching algorithms for highly similar sequences
    Ben Nsira, Nadia
    Lecroq, Thierry
    Prieur-Gaston, Elise
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 1772 - 1777
  • [27] A FAST pattern matching algorithm
    Sheik, SS
    Aggarwal, SK
    Poddar, A
    Balakrishnan, N
    Sekar, K
    JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (04): : 1251 - 1256
  • [28] Fast graph pattern matching
    Cheng, Jiefeng
    Yu, Jeffrey Xu
    Ding, Bolin
    Yu, Philip S.
    Wang, Haixun
    2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 913 - +
  • [29] A fast Boyer-Moore type pattern matching algorithm for highly similar sequences
    Ben Nsira, Nadia
    Lecroq, Thierry
    Elloumi, Mourad
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 13 (03) : 266 - 288
  • [30] Efficient pattern matching of multidimensional sequences
    Lee, S
    Oh, K
    Kwon, D
    Choi, W
    Hong, J
    Choi, J
    Lee, D
    ROUGH SETS, FUZZY SETS, DATA MINING, AND GRANULAR COMPUTING, PT 2, PROCEEDINGS, 2005, 3642 : 202 - 210