A new fast technique for pattern matching in biological sequences

被引:0
|
作者
Osman Ali Sadek Ibrahim
Belal A. Hamed
Tarek Abd El-Hafeez
机构
[1] Minia University,Department of Computer Science, Faculty of Science
[2] Deraya University,Computer Science Unit
来源
关键词
Bioinformatics; Character comparison; Pattern matching; String Matching; DNA Sequences;
D O I
暂无
中图分类号
学科分类号
摘要
At numerous phases of the computational process, pattern matching is essential. It enables users to search for specific DNA subsequences or DNA sequences in a database. In addition, some of these rapidly expanding biological databases are updated on a regular basis. Pattern searches can be improved by using high-speed pattern matching algorithms. Researchers are striving to improve solutions in numerous areas of computational bioinformatics as biological data grows exponentially. Faster algorithms with a low error rate are needed in real-world applications. As a result, this study offers two pattern matching algorithms that were created to help speed up DNA sequence pattern searches. The strategies recommended improve performance by utilizing word-level processing rather than character-level processing, which has been used in previous research studies. In terms of time cost, the proposed algorithms (EFLPM and EPAPM) increased performance by leveraging word-level processing with large pattern size. The experimental results show that the proposed methods are faster than other algorithms for short and long patterns. As a result, the EFLPM algorithm is 54% faster than the FLPM method, while the EPAPM algorithm is 39% faster than the PAPM method.
引用
收藏
页码:367 / 388
页数:21
相关论文
共 50 条
  • [31] Compressed Pattern Matching in Dna Sequences
    Kanchana, N.
    Sarala, S.
    PROCEEDINGS OF 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 9 (ICCSIT 2010), 2010, : 157 - 160
  • [32] Pattern Matching in Polyphonic Musical Sequences
    Alzamel, Mai
    Iliopoulos, Costas
    Lim, Zara
    Moser-Schwaiger, Marie
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, PT IV, AIAI 2024, 2024, 714 : 139 - 151
  • [33] Fast matching method for DNA sequences
    Kim, Jin Wook
    Kim, Eunsang
    Park, Kunsoo
    COMBINATORICS, ALGORITHMS, PROBABILISTIC AND EXPERIMENTAL METHODOLOGIES, 2007, 4614 : 271 - +
  • [34] Fast string matching for DNA sequences
    Ryu, Cheol
    Lecroq, Thierry
    Park, Kunsoo
    THEORETICAL COMPUTER SCIENCE, 2020, 812 (137-148) : 137 - 148
  • [35] A new fast block matching algorithm using the simplex technique
    Rehan, M
    Antoniou, A
    Agathoklis, P
    1998 IEEE SYMPOSIUM ON ADVANCES IN DIGITAL FILTERING AND SIGNAL PROCESSING, 1998, : 30 - 33
  • [36] AN IMPROVED ALGORITHM FOR MATCHING BIOLOGICAL SEQUENCES
    GOTOH, O
    JOURNAL OF MOLECULAR BIOLOGY, 1982, 162 (03) : 705 - 708
  • [37] Fast practical online exact single and multiple pattern matching algorithms in highly similar sequences
    Ben Nsira, Nadia
    Lecroq, Thierry
    Prieur-Gaston, Elise
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2019, 22 (01) : 1 - 18
  • [38] Fast Indexes for Gapped Pattern Matching
    Caceres, Manuel
    Puglisi, Simon J.
    Zhukova, Bella
    SOFSEM 2020: THEORY AND PRACTICE OF COMPUTER SCIENCE, 2020, 12011 : 493 - 504
  • [39] Hardware Architecture for the Fast Pattern Matching
    Kastil, Jan
    Kosar, Vlastimil
    Korenek, Jan
    PROCEEDINGS OF THE 2013 IEEE 16TH INTERNATIONAL SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS & SYSTEMS (DDECS), 2013, : 120 - 123
  • [40] Fast pattern matching in indexed texts
    Senellart, J
    THEORETICAL COMPUTER SCIENCE, 2000, 237 (1-2) : 239 - 262