Combining Multiple K-Means Clusterings of Chemical Structures Using Cluster-Based Similarity Partitioning Algorithm

被引:0
|
作者
Saeedi, Faisal [1 ,2 ]
Salim, Naomie [1 ]
Abdo, Ammar [3 ]
Hentabli, Hamza [1 ]
机构
[1] Univ Teknol Malaysia, Fac Comp Sci & Informat Syst, Johor Baharu, Johor, Malaysia
[2] Dept Informat Technol, Sanhan Community Coll, Sanaa, Yemen
[3] Hodeidah Univ, Dept Comp Sci, Hodeidah, Yemen
来源
ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS | 2012年 / 322卷
关键词
2D Fingerprint; Compound Selection; Consensus Clustering; K-Means; Molecular Datasets; Ward's Method; DATA FUSION; COMBINATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Consensus clustering methods have been used in many areas to improve the quality of individual clusterings. In this paper, graph-based consensus clustering, Cluster-based Similarity Partitioning Algorithm (CSPA), was used to improve the quality of chemical structures clustering by enhancing the ability to separate active from inactive molecules in each cluster and improve the robustness and stability of individual clusterings. The clustering was evaluated using Quality Partition Index (QPI) measure and the results were compared with the Ward's clustering method. The chemical dataset MDL Drug Data Report (MDDR) database was used for experiments. The results obtained by combining multiple K-means clusterings showed that graph-based consensus clustering, CSPA, can improve the quality of individual chemical structure clusterings.
引用
收藏
页码:304 / +
页数:3
相关论文
共 50 条
  • [21] GWO optimized k-means cluster based Oversampling Algorithm
    Subbulaxmi, Santha S.
    Arumugam, G.
    INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2021, 12 (03): : 343 - 355
  • [22] The Application on Intrusion Detection Based on K-means Cluster Algorithm
    Meng Jianliang
    Shang Haikun
    Bian Ling
    2009 INTERNATIONAL FORUM ON INFORMATION TECHNOLOGY AND APPLICATIONS, VOL 1, PROCEEDINGS, 2009, : 150 - 152
  • [23] Flow cluster algorithm based on improved K-means method
    Dong, Shi
    Zhou, Dingding
    Ding, Wei
    Gong, Jian
    IETE JOURNAL OF RESEARCH, 2013, 59 (04) : 326 - 333
  • [24] Information Theory and Voting Based Consensus Clustering for Combining Multiple Clusterings of Chemical Structures
    Saeed, Faisal
    Salim, Naomie
    Abdo, Ammar
    MOLECULAR INFORMATICS, 2013, 32 (07) : 591 - 598
  • [25] A modified version of the K-means algorithm based on the shape similarity distance
    Li, Dan
    Li, Xinbao
    FRONTIERS OF MECHANICAL ENGINEERING AND MATERIALS ENGINEERING II, PTS 1 AND 2, 2014, 457-458 : 1064 - 1068
  • [26] Similarity matrix-based K-means algorithm for text clustering
    曹奇敏
    郭巧
    吴向华
    JournalofBeijingInstituteofTechnology, 2015, 24 (04) : 566 - 572
  • [27] Running section similarity matching based on improved K-means algorithm
    Liang H.
    Tian C.
    Wang T.
    Cao X.
    Yang X.
    Liu Y.
    Dianli Zidonghua Shebei/Electric Power Automation Equipment, 2019, 39 (07): : 119 - 124and140
  • [28] Cluster Analysis Using Firefly-Based K-means Algorithm: A Combined Approach
    Nayak, Janmenjoy
    Naik, Bighnaraj
    Behera, H. S.
    COMPUTATIONAL INTELLIGENCE IN DATA MINING, CIDM 2016, 2017, 556 : 55 - 64
  • [29] KCLP: A K-MEANS CLUSTER-BASED LOCATION PRIVACY PROTECTION SCHEME IN WSNS FOR IOT
    Han, Guangjie
    Wang, Hao
    Guizani, Mohsen
    Chan, Sammy
    Zhang, Wenbo
    IEEE WIRELESS COMMUNICATIONS, 2018, 25 (06) : 84 - 90
  • [30] Comparing performance of feedforward neural nets and K-means for cluster-based market segmentation
    Hruschka, H
    Natter, M
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1999, 114 (02) : 346 - 353