A uniform projection method for motif discovery in DNA sequences

被引:19
|
作者
Raphael, B [1 ]
Liu, LT [1 ]
Varghese, G [1 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
关键词
motif discovery; transcription factor binding sites; random projection; combinatorial designs; low-discrepancy sequences;
D O I
10.1109/TCBB.2004.14
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Buhler and Tompa [5] introduced the random projection algorithm for the motif discovery problem and demonstrated that this algorithm performs well on both simulated and biological samples. We describe a modification of the random projection algorithm, called the uniform projection algorithm, which utilizes a different choice of projections. We replace the random selection of projections by a greedy heuristic that approximately equalizes the coverage of the projections. We show that this change in selection of projections leads to improved performance on motif discovery problems. Furthermore, the uniform projection algorithm is directly applicable to other problems where the random projection algorithm has been used, including comparison of protein sequence databases.
引用
收藏
页码:91 / 94
页数:4
相关论文
共 50 条
  • [21] A greedy two-stage Gibbs sampling method for motif discovery in biological sequences
    Liu Li-fang
    Jiao Li-cheng
    Huo Hong-wei
    BMEI 2008: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOL 1, 2008, : 13 - +
  • [22] RPPMD (Randomly Projected Possible Motif Discovery): An Efficient Bucketing Method for Finding DNA Planted Motif
    Bin Ashraf, Faisal
    Abir, Ali Imam
    Salekin, Md Sirajus
    Mottalib, M. A.
    2017 INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION ENGINEERING (ECCE), 2017, : 509 - 513
  • [23] Motif discovery in upstream sequences of coordinately expressed genes
    Stine, M
    Dasgupta, D
    Mukatira, S
    CEC: 2003 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-4, PROCEEDINGS, 2003, : 1596 - 1603
  • [24] PROBABILISTIC ANALYSIS OF A MOTIF DISCOVERY ALGORITHM FOR MULTIPLE SEQUENCES
    Fu, Bin
    Kao, Ming-Yang
    Wang, Lusheng
    SIAM JOURNAL ON DISCRETE MATHEMATICS, 2009, 23 (04) : 1715 - 1737
  • [25] Sublinear Time Motif Discovery from Multiple Sequences
    Fu, Bin
    Fu, Yunhui
    Xue, Yuan
    ALGORITHMS, 2013, 6 (04) : 636 - 677
  • [26] An Algorithmic Framework for Motif Discovery Problems in Weighted Sequences
    Zhang, Hui
    Guo, Qing
    Iliopouos, Costas S.
    ALGORITHMS AND COMPLEXITY, PROCEEDINGS, 2010, 6078 : 335 - +
  • [27] An Entropy-Based Position Projection Algorithm for Motif Discovery
    Zhang, Yipu
    Wang, Ping
    Yan, Maode
    BIOMED RESEARCH INTERNATIONAL, 2016, 2016
  • [28] Gene discovery in DNA sequences
    Institute for Genomic Research
    不详
    不详
    不详
    IEEE Intell Syst their Appl, 6 (44-48):
  • [29] Gene discovery in DNA sequences
    Salzberg, SL
    IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1999, 14 (06): : 44 - 48
  • [30] Fast structured motif search in DNA sequences
    Halachev, Mihail
    Shiri, Nematollaah
    BIOINFORMATICS RESEARCH AND DEVELOPMENT, PROCEEDINGS, 2008, 13 : 58 - 73