An Evolutionary Algorithm for Feature Selective Double Clustering of Text Documents

被引:0
|
作者
Nourashrafeddin, S. N. [1 ]
Milios, Evangelos [1 ]
Arnold, Dirk V. [1 ]
机构
[1] Dalhousie Univ, Fac Comp Sci, Halifax, NS B3H 4R2, Canada
关键词
Genetic algorithm; co-clustering; multiobjective optimization; text clustering; INFORMATION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We propose FSDC, an evolutionary algorithm for Feature Selective Double Clustering of text documents. We first cluster the terms existing in the document corpus. The term clusters are then fed into multiobjective genetic algorithms to prune non- informative terms and form sets of keyterms representing topics. Based on the topic keyterms found, representative documents for each topic are extracted. These documents are then used as seeds to cluster all documents in the dataset. FSDC is compared to some well- known co- clusterers on real text datasets. The experimental results show that our algorithm can outperform the competitors.
引用
收藏
页码:446 / 453
页数:8
相关论文
共 50 条
  • [41] Unsupervised Feature Selection Technique Based on Harmony Search Algorithm for Improving the Text Clustering
    Abualigah, Laith Mohammad
    Khader, Ahamad Tajudin
    Al-Betar, Mohammed Azmi
    2016 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (CSIT), 2016,
  • [42] Automatic Text Documents Summarization through Sentences Clustering
    Alguliev, R. M.
    Alyguliev, R. M.
    JOURNAL OF AUTOMATION AND INFORMATION SCIENCES, 2008, 40 (09) : 53 - 63
  • [43] A Social Spider Optimization Approach for Clustering Text Documents
    Chandran, T. Ravi
    Reddy, A. V.
    Janet, B.
    PROCEEDINGS OF THE 2016 IEEE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL & ELECTRONICS, INFORMATION, COMMUNICATION & BIO INFORMATICS (IEEE AEEICB-2016), 2016, : 22 - 26
  • [44] Using the self organizing map for clustering of text documents
    Isa, Dino
    Kallimani, V. P.
    Lee, Lam Hong
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (05) : 9584 - 9591
  • [45] Text clustering to help knowledge acquisition from documents
    Lapalut, S
    ADVANCES IN KNOWLEDGE ACQUISITION, 1996, 1076 : 115 - 130
  • [46] Short documents clustering in very large text databases
    Wang, Yongheng
    Jia, Yan
    Yang, Shuqiang
    WEB INFORMATION SYSTEMS - WISE 2006 WORKSHOPS, PROCEEDINGS, 2006, 4256 : 83 - 93
  • [47] Genetic Algorithms Revisited for Improved Clustering of Text Documents
    Abdullah-Al-Wadud, M.
    2018 INTERNATIONAL CONFERENCE ON INNOVATION IN ENGINEERING AND TECHNOLOGY (ICIET), 2018,
  • [48] Simultaneous clustering and dynamic keyword weighting for text documents
    Frigui, H
    Nasraoui, F
    SURVEY OF TEXT MINING: CLUSTERING, CLASSIFICATION, AND RETRIEVAL, 2004, : 45 - 72
  • [49] Contextual adaptive clustering of Web and text documents with personalization
    Ciesielski, Krzysztof
    Klopotek, Mieczyslaw A.
    Wierzchon, Slawomir T.
    MINING COMPLEX DATA, 2008, 4944 : 116 - 130
  • [50] A probabilistic model for clustering text documents with multiple fields
    Zhu, Shanfeng
    Takigawa, Ichigaku
    Zhang, Shuqin
    Mamitsuka, Hiroshi
    ADVANCES IN INFORMATION RETRIEVAL, 2007, 4425 : 331 - +