A new cluster merging algorithm of Suffix Tree Clustering

被引:0
|
作者
Wang, Jianhua [1 ]
Li, Ruixu [1 ]
机构
[1] Yantai Univ, Dept Comp Sci, Shandong, Peoples R China
来源
关键词
Suffix Tree Clustering; cluster merging algorithm;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document clustering methods can be used to structure large sets of text or hypertext documents. Suffix Tree Clustering has been proved to be a good approach for documents clustering. However. the cluster merging algorithm of Suffix Tree Clustering is based on the over lap of their document sets, which totally ignore the similarity between the non-overlap parts of different Clusters. In this paper, we introduce a novel cluster merging approach which will combines the cosine similarity and overlap percentage. Using this method, we can get a better clustering result and a comparative small number of clusters.
引用
收藏
页码:197 / +
页数:2
相关论文
共 50 条
  • [1] Search Results Clustering Algorithm based on the Suffix Tree
    Wang, Dengwei
    Liu, Libo
    Dong, Jing
    Zheng, Jiao
    2015 2ND INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING ICISCE 2015, 2015, : 456 - 460
  • [2] WordNet-Based Suffix Tree Clustering Algorithm
    Dang, Qiuyue
    Zhang, Jiwei
    Lu, Yueming
    Zhang, Kuo
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND COMPUTER APPLICATIONS (ICSA 2013), 2013, 92 : 66 - 74
  • [3] Improving Suffix Tree Clustering Algorithm for Web Documents
    Zhuang, Yan
    Chen, Youguang
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON LOGISTICS, ENGINEERING, MANAGEMENT AND COMPUTER SCIENCE (LEMCS 2015), 2015, 117 : 1557 - 1561
  • [4] A Chinese Web Page Clustering Algorithm Based on the Suffix Tree
    YANG Jian-wu National Key Laboratory for Text Processing
    WuhanUniversityJournalofNaturalSciences, 2004, (05) : 817 - 822
  • [5] A Novel Clustering Algorithm Based on Gravity and Cluster Merging
    Zhong, Jiang
    Liu, Longhai
    Li, Zhiguo
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2010, PT I, 2010, 6440 : 302 - 309
  • [6] Applying Semantic Suffix Net to Suffix Tree Clustering
    Janruang, Jongkol
    Guha, Sumanta
    2011 3RD CONFERENCE ON DATA MINING AND OPTIMIZATION (DMO), 2011, : 146 - 152
  • [7] Clustering Web Search Results Based on Interactive Suffix Tree Algorithm
    Wang, Ying
    Zuo, Wanli
    Peng, Tao
    He, Fengling
    Hu, Hailong
    THIRD 2008 INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, VOL 2, PROCEEDINGS, 2008, : 851 - 857
  • [8] Improving Suffix Tree Clustering with New Ranking and Similarity Measures
    Worawitphinyo, Phiradit
    Gao, Xiaoying
    Jabeen, Shahida
    ADVANCED DATA MINING AND APPLICATIONS, PT II, 2011, 7121 : 55 - 68
  • [9] Gene sequences clustering and identifying functional domain using a suffix tree algorithm
    Han, Sang Il
    Lee, Sung Gun
    Hwang, Kyu Suk
    Kim, Young Han
    2006 SICE-ICASE INTERNATIONAL JOINT CONFERENCE, VOLS 1-13, 2006, : 2315 - +
  • [10] CLAGen: A tool for clustering and annotating gene sequences using a suffix tree algorithm
    Han, Sang il
    Lee, Sung Gun
    Kim, Kyung-Hoon
    Choi, Chung Jung
    Kim, Young Han
    Hwang, Kyu Suk
    BIOSYSTEMS, 2006, 84 (03) : 175 - 182