A uniform projection method for motif discovery in DNA sequences

被引:19
|
作者
Raphael, B [1 ]
Liu, LT [1 ]
Varghese, G [1 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
关键词
motif discovery; transcription factor binding sites; random projection; combinatorial designs; low-discrepancy sequences;
D O I
10.1109/TCBB.2004.14
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Buhler and Tompa [5] introduced the random projection algorithm for the motif discovery problem and demonstrated that this algorithm performs well on both simulated and biological samples. We describe a modification of the random projection algorithm, called the uniform projection algorithm, which utilizes a different choice of projections. We replace the random selection of projections by a greedy heuristic that approximately equalizes the coverage of the projections. We show that this change in selection of projections leads to improved performance on motif discovery problems. Furthermore, the uniform projection algorithm is directly applicable to other problems where the random projection algorithm has been used, including comparison of protein sequence databases.
引用
收藏
页码:91 / 94
页数:4
相关论文
共 50 条
  • [1] A visualization approach to Motif discovery in DNA sequences
    Rambally, Gerard
    PROCEEDINGS IEEE SOUTHEASTCON 2007, VOLS 1 AND 2, 2007, : 348 - 353
  • [2] Multiobjective optimization algorithms for motif discovery in DNA sequences
    Gonzalez-Alvarez, David L.
    Vega-Rodriguez, Miguel A.
    Rubio-Largo, Alvaro
    GENETIC PROGRAMMING AND EVOLVABLE MACHINES, 2015, 16 (02) : 167 - 209
  • [3] Multiobjective optimization algorithms for motif discovery in DNA sequences
    David L. González-Álvarez
    Miguel A. Vega-Rodríguez
    Álvaro Rubio-Largo
    Genetic Programming and Evolvable Machines, 2015, 16 : 167 - 209
  • [4] Motif Discovery in Unaligned DNA Sequences Using Genetic Algorithm
    Muttakin, Al
    Huq, Mohammad Rezwanul
    2017 4TH INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL ENGINEERING (ICAEE), 2017, : 725 - 730
  • [5] Probabilistic Models for Semisupervised Discriminative Motif Discovery in DNA Sequences
    Kim, Jong Kyoung
    Choi, Seungjin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2011, 8 (05) : 1309 - 1317
  • [6] GAPK: Genetic Algorithms with Prior Knowledge for Motif Discovery in DNA Sequences
    Wang, Dianhui
    Li, Xi
    2009 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-5, 2009, : 277 - 284
  • [7] Discriminative motif discovery in DNA and protein sequences using the DEME algorithm
    Redhead, Emma
    Bailey, Timothy L.
    BMC BIOINFORMATICS, 2007, 8 (1)
  • [8] Discriminative motif discovery in DNA and protein sequences using the DEME algorithm
    Emma Redhead
    Timothy L Bailey
    BMC Bioinformatics, 8
  • [9] A Clustering-Based Algorithm for De Novo Motif Discovery in DNA Sequences
    Ebrahim-Abadi, Mohammad Haghir
    Fatemizadeh, Emad
    2017 24TH NATIONAL AND 2ND INTERNATIONAL IRANIAN CONFERENCE ON BIOMEDICAL ENGINEERING (ICBME), 2017, : 267 - 272
  • [10] Fast Motif Discovery in Short Sequences
    Liu, Honglei
    Han, Fangqiu
    Zhou, Hongjun
    Yan, Xifeng
    Kosik, Kenneth S.
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1158 - 1169