Document Clustering Using Incremental and Pairwise Approaches

被引:0
|
作者
Tran, Tien [1 ]
Nayak, Richi [1 ]
Bruza, Peter [1 ]
机构
[1] Queensland Univ Technol, Brisbane, Qld 4001, Australia
来源
关键词
Clustering; structure; content; XML; INEX; 2007;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents the experiments and results of a clustering approach for clustering of the large Wikipedia dataset in the INEX 2007 Document Mining Challenge. The clustering approach employed makes use of an incremental clustering method and a pairwise clustering method. The approach enables us to perform the clustering task on a large dataset by first reducing the dimension of the dataset to an undefined number of clusters using the incremental method. The lower-dimension dataset is then clustered to a required number of clusters using the pairwise method. In this way, clustering of the large number of documents is performed successfully and the accuracy of the clustering solution is achieved.
引用
收藏
页码:222 / 233
页数:12
相关论文
共 50 条
  • [41] Evolve systems using incremental clustering approach
    Kulkarni P.A.
    Mulay P.
    Evol. Syst., 2 (71-85): : 71 - 85
  • [42] Sentence Clustering in Text Document Using Fuzzy Clustering Algorithm
    Sruthi, S.
    Shalini, L.
    2014 INTERNATIONAL CONFERENCE ON CONTROL, INSTRUMENTATION, COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICCICCT), 2014, : 1473 - 1476
  • [43] An incremental learning clustering approach using exemplars
    Pumphrey, D
    Lazarescu, M
    PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND APPLICATIONS, VOLS 1AND 2, 2004, : 240 - 245
  • [44] Document Clustering using GIS Visualizing and EM Clustering Method
    Dogdas, Tayfun
    Akyokus, Selim
    2013 IEEE INTERNATIONAL SYMPOSIUM ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (IEEE INISTA), 2013,
  • [45] Incremental Clustering for Hierarchical Clustering
    Narita, Kakeru
    Hochin, Teruhisa
    Nomiya, Hiroki
    2018 5TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE/ INTELLIGENCE AND APPLIED INFORMATICS (CSII 2018), 2018, : 102 - 107
  • [46] ENHANCING ANT-BASED CLUSTERING USING PAIRWISE CONSTRAINTS
    Yang, Yan
    Chen, Jintan
    Tan, Wei
    INTELLIGENT DECISION MAKING SYSTEMS, VOL. 2, 2010, : 76 - +
  • [47] Lightly-supervised Clustering Using Pairwise Constraint Propagation
    Huang, Jianbin
    Sun, Heli
    2008 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEM AND KNOWLEDGE ENGINEERING, VOLS 1 AND 2, 2008, : 765 - +
  • [48] An Efficient Productive Feature Selection and Document Clustering (PFS-DocC) Model for Document Clustering Document Clustering using PFS-DocC Model
    Pitchandi, Perumal
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (04) : 125 - 133
  • [49] A brief survey on Meta-heuristic Approaches for Web Document Clustering
    Singh, Manjit
    Bhasin, Anshu
    Jangra, Surender
    2018 4TH INTERNATIONAL CONFERENCE ON COMPUTING SCIENCES (ICCS), 2018, : 98 - 101
  • [50] Document clustering using nonnegative matrix factorization/
    Shahnaz, F
    Berry, MW
    Pauca, VP
    Plemmons, RJ
    INFORMATION PROCESSING & MANAGEMENT, 2006, 42 (02) : 373 - 386