Probability of large-scale data set EM clustering algorithms based on partial information constraints

被引:0
|
作者
Liu, Xiaoyan [1 ]
机构
[1] Changchun Univ Sci & Technol, Changchun 130600, Jilin Province, Peoples R China
关键词
Some constraint information; Clustering; The data set; The clustering quality; The probability of clustering algorithm;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The current situation, the need for clustering of data is very large, and the use of traditional algorithm for clustering process often tedious and time consuming is very long, the effect is not obvious. Based on this, this paper proposes a data sets EM probability based on some constraint information clustering algorithm, the detailed implementation process of the whole algorithm is described. Through experiment contrast scalable EM, positive_PC_SEM and full_PC_SEM clustering quality and efficiency of execution of the algorithm, the results show that the positive_PC_SEM algorithm and scalable EM algorithm compared to the clustering quality and efficiency is higher, although full_PC_SEM clustering quality is very high, but requires a lot of time.
引用
收藏
页码:1748 / 1751
页数:4
相关论文
共 50 条
  • [41] Large-Scale Data Clustering Algorithm Based on Quantum Immune Regulation Network
    Li, Yangyang
    Bai, Xiaoyu
    Hou, Xiaoju
    Jiao, Licheng
    2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017,
  • [42] Analysis of large-scale power quality monitoring data based on quantum clustering
    Zhong, Qing
    Liang, Jiahao
    Xu, Zhong
    Meyer, Jan
    Wang, Longjun
    Wang, Gang
    ELECTRIC POWER SYSTEMS RESEARCH, 2023, 220
  • [43] Reduce Redundancies: Signal-based Clustering of Large-scale Fingerprint Data
    Mueller, Mathias
    Schmalzbauer, Martin
    Meyer, Steffen
    Nicklas, Daniela
    2018 IEEE 29TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2018, : 842 - 848
  • [44] A large-scale crop protection bioassay data set
    Anna Gaulton
    Namrata Kale
    Gerard J. P. van Westen
    Louisa J. Bellis
    A. Patrícia Bento
    Mark Davies
    Anne Hersey
    George Papadatos
    Mark Forster
    Philip Wege
    John P. Overington
    Scientific Data, 2
  • [45] A large-scale crop protection bioassay data set
    Gaulton, Anna
    Kale, Namrata
    van Westen, Gerard J. P.
    Bellis, Louisa J.
    Bento, A. Patricia
    Davies, Mark
    Hersey, Anne
    Papadatos, George
    Forster, Mark
    Wege, Philip
    Overington, John P.
    SCIENTIFIC DATA, 2015, 2
  • [46] Distributed Entity Resolution Based on Similarity Join for Large-Scale Data Clustering
    Nie, Tiezheng
    Lee, Wang-chien
    Shen, Derong
    Yu, Ge
    Kou, Yue
    WEB-AGE INFORMATION MANAGEMENT, WAIM 2014, 2014, 8485 : 138 - 149
  • [47] Large-Scale DataSet Cascading Clustering by Item Set and Space Decomposition
    Melnyk, Roman
    Tushnytskyy, Ruslan
    MEMSTECH: 2009 INTERNATIONAL CONFERENCE ON PERSPECTIVE TECHNOLOGIES AND METHODS IN MEMS DESIGN, 2009, : 81 - 83
  • [48] Inverted Index Construction Algorithms For Large-Scale Data
    Wang, He
    Chi, Chengying
    Zhang, Xiumei
    Zhan, Yunyun
    IAENG International Journal of Computer Science, 2022, 49 (04)
  • [49] A large-scale MAGDM model based on SKNN and weighted clustering under incomplete information
    Wu, Qianqian
    Tian, Donghong
    Lan, Ruike
    Li, Min
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 122
  • [50] Parallel algorithms for clustering high-dimensional large-scale datasets
    Nagesh, H
    Goil, S
    Choudhary, A
    DATA MINING FOR SCIENTIFIC AND ENGINEERING APPLICATIONS, 2001, 2 : 335 - 356