CFOND: Consensus Factorization for Co-Clustering Networked Data

被引:32
|
作者
Guo, Ting [1 ]
Pan, Shirui [2 ]
Zhu, Xingquan [3 ]
Zhang, Chengqi [2 ]
机构
[1] CSIRO, Data61, Sydney, NSW 2015, Australia
[2] Univ Technol Sydney, Fac Engn & Informat Technol, Ctr Artificial Intelligence, Sydney, NSW 2007, Australia
[3] Florida Atlantic Univ, Dept Comp & Elect Engn & Comp Sci, Boca Raton, FL 33431 USA
关键词
Networked data; networks; co-clustering; topology; nonnegative matrix factorization; ALGORITHMS;
D O I
10.1109/TKDE.2018.2846555
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Networked data are common in domains where instances are characterized by both feature values and inter-dependency relationships. Finding cluster structures for networked instances and discovering representative features for each cluster represent a special co-clustering task usefully for many real-world applications, such as automatic categorization of scientific publications and finding representative key-words for each cluster. To date, although co-clustering has been commonly used for finding clusters for both instances and features, all existing methods are focused on instance-feature values, without leveraging valuable topology relationships between instances to help boost co-clustering performance. In this paper, we propose CFOND, a consensus factorization based framework for co-clustering networked data. We argue that feature values and linkages provide useful information from different perspectives, but they are not always consistent and therefore need to be carefully aligned for best clustering results. In the paper, we advocate a consensus factorization principle, which simultaneously factorizes information from three aspects: network topology structures, instance-feature content relationships, and feature-feature correlations. The consensus factorization ensures that the final cluster structures are consistent across information from the three aspects with minimum errors. Experiments on real-life networks validate the performance of our algorithm.
引用
收藏
页码:706 / 719
页数:14
相关论文
共 50 条
  • [21] Multiscale co-clustering for tensor data based on canonical polyadic decomposition and slice-wise factorization
    Wei, Zhenghong
    Zhao, Hongya
    Zhao, Lan
    Yan, Hong
    INFORMATION SCIENCES, 2019, 503 : 72 - 91
  • [22] Adaptive Spectral Co-clustering for Multiview Data
    Son, Jeong-Woo
    Jeon, Junekey
    Lee, Sang-Yun
    Kim, Sun-Joong
    2016 18TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATIONS TECHNOLOGY (ICACT) - INFORMATION AND COMMUNICATIONS FOR SAFE AND SECURE LIFE, 2016, : 447 - 450
  • [23] Co-clustering over multiple dynamic data streams based on non-negative matrix factorization
    Chun-Yan Sang
    Di-Hua Sun
    Applied Intelligence, 2014, 41 : 487 - 502
  • [24] Co-clustering over multiple dynamic data streams based on non-negative matrix factorization
    Sang, Chun-Yan
    Sun, Di-Hua
    APPLIED INTELLIGENCE, 2014, 41 (02) : 487 - 502
  • [25] Orthogonal Dual Graph-Regularized Nonnegative Matrix Factorization for Co-Clustering
    Tang, Jiayi
    Wan, Zhong
    JOURNAL OF SCIENTIFIC COMPUTING, 2021, 87 (03)
  • [26] Orthogonal Dual Graph-Regularized Nonnegative Matrix Factorization for Co-Clustering
    Jiayi Tang
    Zhong Wan
    Journal of Scientific Computing, 2021, 87
  • [27] Bayesian Co-clustering
    Shan, Hanhuai
    Banerjee, Arindam
    ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 530 - 539
  • [28] Tri-regularized nonnegative matrix tri-factorization for co-clustering
    Deng, Ping
    Li, Tianrui
    Wang, Hongjun
    Horng, Shi-Jinn
    Yu, Zeng
    Wang, Xiaomin
    KNOWLEDGE-BASED SYSTEMS, 2021, 226
  • [29] A Survey of Co-Clustering
    Wang, Hongjun
    Song, Yi
    Chen, Wei
    Luo, Zhipeng
    Li, Chongshou
    Li, Tianrui
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (09)
  • [30] Directional co-clustering
    Aghiles Salah
    Mohamed Nadif
    Advances in Data Analysis and Classification, 2019, 13 : 591 - 620