Clustering sequence data using hidden Markov model representation

被引:10
|
作者
Li, C [1 ]
Biswas, G [1 ]
机构
[1] Vanderbilt Univ, Dept Comp Sci, Nashville, TN 37235 USA
关键词
clustering; hidden Markov model; model selection; Bayesian Information Criterion(BIC); mutual information;
D O I
10.1117/12.339979
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposed a clustering methodology for sequence data using hidden Markov model(HMM) representation. The proposed methodology improves upon existing HMM based clustering methods in two ways: (i) it enables HMMs to dynamically change its model structure to obtain a better fit model for data during clustering process, and (ii) it provides objective criterion function to select the optimal clustering partition. The algorithm is presented in terms of four nested levels of searches: (i) the search. for the optimal number of clusters in a partition, (ii) the search for the optimal structure for a given partition, (iii) the search for the optimal HMM structure for each cluster, and (iv) the search for the optimal HMM parameters for each HMM. Preliminary results are given to support the proposed methodology.
引用
收藏
页码:14 / 21
页数:4
相关论文
共 50 条
  • [1] Auroral Sequence Representation and Classification Using Hidden Markov Models
    Yang, Qiuju
    Liang, Jimin
    Hu, Zejun
    Zhao, Heng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2012, 50 (12): : 5049 - 5060
  • [2] Hidden Markov Model Optimized by PSO Algorithm for Gene Sequence Clustering
    Soruri, Mohammad
    Sadri, Javad
    Zahiri, S. Hamid
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, DATA AND CLOUD COMPUTING (ICC 2017), 2017,
  • [3] Sequence Clustering with the Self-Organizing Hidden Markov Model Map
    Ferles, Christos
    Stafylopatis, Andreas
    8TH IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING, VOLS 1 AND 2, 2008, : 430 - 436
  • [4] Multiple sequence alignments using hidden Markov Model
    Ergezer, H
    Leblebicioglu, G
    PROCEEDINGS OF THE IEEE 12TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, 2004, : 434 - 437
  • [5] Gene Sequence Clustering Based on the Profile Hidden Markov Model with Differential Identifiability
    Ren, Xujie
    Shang, Tao
    Jiang, Yatong
    Liu, Jianwei
    SECURITY AND COMMUNICATION NETWORKS, 2021, 2021 (2021)
  • [6] A new text clustering method using hidden Markov model
    Fu, Yan
    Yang, Dongqing
    Tang, Shiwei
    Wang, Tengjiao
    Gao, Aiqiang
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PROCEEDINGS, 2007, 4592 : 73 - +
  • [7] Hidden Markov Model Representation Using Probabilistic Neural Network
    Hewahi, Nabil M.
    BRAIN-BROAD RESEARCH IN ARTIFICIAL INTELLIGENCE AND NEUROSCIENCE, 2018, 9 (03): : 50 - 62
  • [8] Attack Sequence Detection in Cloud Using Hidden Markov Model
    Chen, Chia-Mei
    Guan, D. J.
    Huang, Yu-Zhi
    Ou, Ya-Hui
    PROCEEDINGS OF THE 2012 SEVENTH ASIA JOINT CONFERENCE ON INFORMATION SECURITY (ASIAJCIS 2012), 2012, : 100 - 103
  • [9] HMMGEP: clustering gene expression data using hidden Markov models
    Ji, XL
    Yuan, Y
    Li, YD
    Sun, ZR
    BIOINFORMATICS, 2004, 20 (11) : 1799 - 1800
  • [10] A hidden Markov model for rainfall using breakpoint data
    Sansom, J
    JOURNAL OF CLIMATE, 1998, 11 (01) : 42 - 53