Gaussian Mixture Model Clustering with Incomplete Data

被引:34
|
作者
Zhang, Yi [1 ]
Li, Miaomiao [1 ,2 ]
Wang, Siwei [1 ]
Dai, Sisi [1 ]
Luo, Lei [1 ]
Zhu, En [1 ]
Xu, Huiying [3 ,4 ]
Zhu, Xinzhong [3 ]
Yao, Chaoyun [5 ]
Zhou, Haoran [6 ]
机构
[1] NUDT, Sch Comp, Changsha, Peoples R China
[2] Changsha Univ, Changsha, Hunan, Peoples R China
[3] Zhejiang Normal Univ, Coll Math & Comp Sci, Hangzhou, Zhejiang, Peoples R China
[4] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[5] NUDT, Lab Complex Electromagnet Environm Effects Elect, Changsha, Peoples R China
[6] Chongqing Univ Technol, Chongqing, Peoples R China
基金
中国国家自然科学基金;
关键词
GMM; clustering; EM; incomplete data;
D O I
10.1145/3408318
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gaussian mixturemodel (GMM) clustering has been extensively studied due to its effectiveness and efficiency. Though demonstrating promising performance in various applications, it cannot effectively address the absent features among data, which is not uncommon in practical applications. In this article, different from existing approaches that first impute the absence and then perform GMM clustering tasks on the imputed data, we propose to integrate the imputation and GMM clustering into a unified learning procedure. Specifically, the missing data is filled by the result of GMM clustering, and the imputed data is then taken for GMM clustering. These two steps alternatively negotiate with each other to achieve optimum. By this way, the imputed data can best serve for GMM clustering. A two-step alternative algorithm with proved convergence is carefully designed to solve the resultant optimization problem. Extensive experiments have been conducted on eight UCI benchmark datasets, and the results have validated the effectiveness of the proposed algorithm.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Structural α-Entropy Weighting Gaussian Mixture Model for Subspace Clustering
    Li K.
    Zhang K.-X.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2022, 50 (03): : 718 - 725
  • [32] Unsupervised rough clustering method based on Gaussian mixture model
    Dept. of Computer Science and Technology, Xi'an Jiaotong University, Xi'an 710049, China
    Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2006, 38 (02): : 256 - 259
  • [33] Spatiotemporal clustering using Gaussian processes embedded in a mixture model
    Vanhatalo, Jarno
    Foster, Scott D.
    Hosack, Geoffrey R.
    ENVIRONMETRICS, 2021, 32 (07)
  • [34] Shared Gaussian Process Latent Variable Model for Incomplete Multiview Clustering
    Li, Ping
    Chen, Songcan
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (01) : 61 - 73
  • [35] Vine copula mixture models and clustering for non-Gaussian data
    Sahin, Ozge
    Czado, Claudia
    ECONOMETRICS AND STATISTICS, 2022, 22 : 136 - 158
  • [36] Adapted Expectation Maximization Algorithm for Gaussian Mixture Clustering With Censored Data
    Yu H.-Y.
    Chen J.-J.
    Qiu H.
    Wang Y.
    Wang R.-F.
    Zidonghua Xuebao/Acta Automatica Sinica, 2021, 47 (06): : 1302 - 1314
  • [37] Power Consumption Prediction via Improved Gaussian Mixture Clustering Model for Automatically Clustering
    Chen, Buhua
    Liu, Hanjiang
    2022 IEEE 6TH ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2022, : 1969 - 1973
  • [38] MIXTURE MODEL CLUSTERING OF BINNED UNCERTAIN DATA
    Hamdan, Hani
    CONTROL ENGINEERING AND APPLIED INFORMATICS, 2012, 14 (01): : 67 - 73
  • [39] A mixture model approach for binned data clustering
    Samé, A
    Ambroise, C
    Govaert, G
    ADVANCES IN INTELLIGENT DATA ANALYSIS V, 2003, 2810 : 265 - 274
  • [40] A Bayesian mixture model for clustering circular data
    Rodriguez, Carlos E.
    Nunez-Antonio, Gabriel
    Escarela, Gabriel
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2020, 143