A novel incremental conceptual hierarchical text clustering method using CFu-tree

被引:13
|
作者
Peng, Tao [1 ,2 ,3 ]
Liu, Lu [1 ,2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Peoples R China
[2] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
[3] Minist Educ, Key Lab Symbol Computat & Knowledge Engn, Changchun 130012, Peoples R China
关键词
Text clustering; CFu-tree; Comparison Variation (CV); Incremental hierarchical clustering; EFFICIENT ALGORITHM;
D O I
10.1016/j.asoc.2014.11.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a data mining method, clustering, which is one of the most important tools in information retrieval, organizes data based on unsupervised learning which means that it does not require any training data. But, some text clustering algorithms cannot update existing clusters incrementally and, instead, have to recompute a new clustering from scratch. In view of above, this paper presents a novel down-top incremental conceptual hierarchical text clustering approach using CFu-tree (ICHTC-CF) representation, which starts with each item as a separate cluster. Term-based feature extraction is used for summarizing a cluster in the process. The Comparison Variation measure criterion is also adopted for judging whether the closest pair of clusters can be merged or a previous cluster can be split. And, our incremental clustering method is not sensitive to the input data order. Experimental results show that the performance of our method outperforms k-means, CLIQUE, single linkage clustering and complete linkage clustering, which indicate our new technique is efficient and feasible. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:269 / 278
页数:10
相关论文
共 50 条
  • [41] A Novel Stable Clustering Design Method for Hierarchical Satellite Network
    Zhou Mu
    Guo Qing
    Wang Zhenyong
    CHINESE JOURNAL OF AERONAUTICS, 2010, 23 (01) : 91 - 102
  • [42] A novel method to evaluate clustering algorithms for hierarchical optical networks
    Shanguo Huang
    Weihua Lian
    Xian Zhang
    Bingli Guo
    Pei Luo
    Jie Zhang
    Wanyi Gu
    Photonic Network Communications, 2012, 23 : 183 - 190
  • [44] A novel method to evaluate clustering algorithms for hierarchical optical networks
    Huang, Shanguo
    Lian, Weihua
    Zhang, Xian
    Guo, Bingli
    Luo, Pei
    Zhang, Jie
    Gu, Wanyi
    PHOTONIC NETWORK COMMUNICATIONS, 2012, 23 (02) : 183 - 190
  • [45] Combining hierarchical clustering approaches using the PCA method
    Jafarzadegan, Mohammad
    Safi-Esfahani, Faramarz
    Beheshti, Zahra
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 137 : 1 - 10
  • [46] Dynamic Image Segmentation Method Using Hierarchical Clustering
    Galbiati, Jorge
    Allende, Hector
    Becerra, Carlos
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, PROCEEDINGS, 2009, 5856 : 177 - +
  • [47] Load forecasting using hierarchical clustering method for building
    Hwang, Hye-Mi
    Lee, Sung-Hee
    Park, Jong-Bae
    Park, Yong-Gi
    Son, Sung-Yong
    Transactions of the Korean Institute of Electrical Engineers, 2015, 64 (01): : 41 - 47
  • [48] Hierarchical conceptual clustering based on quantile method for identifying microscopic details in distributional data
    Umbleja, Kadri
    Ichino, Manabu
    Yaguchi, Hiroyuki
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2021, 15 (02) : 407 - 436
  • [49] Hierarchical conceptual clustering based on quantile method for identifying microscopic details in distributional data
    Kadri Umbleja
    Manabu Ichino
    Hiroyuki Yaguchi
    Advances in Data Analysis and Classification, 2021, 15 : 407 - 436
  • [50] Secured Packet Inspection with Hierarchical Pattern Matching implemented using Incremental Clustering Algorithm
    Sethi, Purna Chandra
    Behera, Prafulla Kumar
    2014 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND APPLICATIONS (ICHPCA), 2014,