A new fast technique for pattern matching in biological sequences

被引:0
|
作者
Osman Ali Sadek Ibrahim
Belal A. Hamed
Tarek Abd El-Hafeez
机构
[1] Minia University,Department of Computer Science, Faculty of Science
[2] Deraya University,Computer Science Unit
来源
关键词
Bioinformatics; Character comparison; Pattern matching; String Matching; DNA Sequences;
D O I
暂无
中图分类号
学科分类号
摘要
At numerous phases of the computational process, pattern matching is essential. It enables users to search for specific DNA subsequences or DNA sequences in a database. In addition, some of these rapidly expanding biological databases are updated on a regular basis. Pattern searches can be improved by using high-speed pattern matching algorithms. Researchers are striving to improve solutions in numerous areas of computational bioinformatics as biological data grows exponentially. Faster algorithms with a low error rate are needed in real-world applications. As a result, this study offers two pattern matching algorithms that were created to help speed up DNA sequence pattern searches. The strategies recommended improve performance by utilizing word-level processing rather than character-level processing, which has been used in previous research studies. In terms of time cost, the proposed algorithms (EFLPM and EPAPM) increased performance by leveraging word-level processing with large pattern size. The experimental results show that the proposed methods are faster than other algorithms for short and long patterns. As a result, the EFLPM algorithm is 54% faster than the FLPM method, while the EPAPM algorithm is 39% faster than the PAPM method.
引用
收藏
页码:367 / 388
页数:21
相关论文
共 50 条
  • [41] Quantum Pattern Matching Fast on Average
    Ashley Montanaro
    Algorithmica, 2017, 77 : 16 - 39
  • [42] Fast and Simple Circular Pattern Matching
    Susik, Robert
    Grabowski, Szymon
    Deorowicz, Sebastian
    MAN-MACHINE INTERACTIONS 3, 2014, 242 : 537 - 544
  • [43] Quantum Pattern Matching Fast on Average
    Montanaro, Ashley
    ALGORITHMICA, 2017, 77 (01) : 16 - 39
  • [44] Fast exact pattern matching algorithm
    College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China
    J. Comput. Inf. Syst., 2009, 1 (235-243):
  • [45] Pattern matching for arc-annotated sequences
    Gramm, J
    Guo, J
    Niedermeier, R
    FST TCS 2002: FOUNDATIONS OF SOFTWARE TECHNOLOGY AND THEOETICAL COMPUTER SCIENCE, PROCEEDINGS, 2002, 2556 : 182 - 193
  • [46] A Comparative Study of Pattern Matching Algorithms on Sequences
    Min, Fan
    Wu, Xindong
    ROUGH SETS, FUZZY SETS, DATA MINING AND GRANULAR COMPUTING, PROCEEDINGS, 2009, 5908 : 510 - +
  • [47] Pattern Matching for Arc-Annotated Sequences
    Gramm, Jens
    Guo, Jiong
    Niedermeier, Rolf
    ACM TRANSACTIONS ON ALGORITHMS, 2006, 2 (01) : 44 - 65
  • [48] RSMA Matching Algorithm for Searching Biological Sequences
    Klaib, Ahmad Fadel
    Osborne, Hugh
    2009 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION TECHNOLOGY, 2009, : 190 - 194
  • [49] Fast motif selection for biological sequences
    Kuksa, Pavel
    Pavlovic, Vladimir
    2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2009, : 79 - 82
  • [50] A Fast Contiguous Sequential Pattern Mining Technique in DNA Data Sequences Using Position Information
    Zerin, Syeda Farzana
    Jeong, Byeong-Soo
    IETE TECHNICAL REVIEW, 2011, 28 (06) : 511 - 519