CFOND: Consensus Factorization for Co-Clustering Networked Data

被引:32
|
作者
Guo, Ting [1 ]
Pan, Shirui [2 ]
Zhu, Xingquan [3 ]
Zhang, Chengqi [2 ]
机构
[1] CSIRO, Data61, Sydney, NSW 2015, Australia
[2] Univ Technol Sydney, Fac Engn & Informat Technol, Ctr Artificial Intelligence, Sydney, NSW 2007, Australia
[3] Florida Atlantic Univ, Dept Comp & Elect Engn & Comp Sci, Boca Raton, FL 33431 USA
关键词
Networked data; networks; co-clustering; topology; nonnegative matrix factorization; ALGORITHMS;
D O I
10.1109/TKDE.2018.2846555
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Networked data are common in domains where instances are characterized by both feature values and inter-dependency relationships. Finding cluster structures for networked instances and discovering representative features for each cluster represent a special co-clustering task usefully for many real-world applications, such as automatic categorization of scientific publications and finding representative key-words for each cluster. To date, although co-clustering has been commonly used for finding clusters for both instances and features, all existing methods are focused on instance-feature values, without leveraging valuable topology relationships between instances to help boost co-clustering performance. In this paper, we propose CFOND, a consensus factorization based framework for co-clustering networked data. We argue that feature values and linkages provide useful information from different perspectives, but they are not always consistent and therefore need to be carefully aligned for best clustering results. In the paper, we advocate a consensus factorization principle, which simultaneously factorizes information from three aspects: network topology structures, instance-feature content relationships, and feature-feature correlations. The consensus factorization ensures that the final cluster structures are consistent across information from the three aspects with minimum errors. Experiments on real-life networks validate the performance of our algorithm.
引用
收藏
页码:706 / 719
页数:14
相关论文
共 50 条
  • [41] Subspace Weighting Co-Clustering of Gene Expression Data
    Chen, Xiaojun
    Huang, Joshua Z.
    Wu, Qingyao
    Yang, Min
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (02) : 352 - 364
  • [42] FAST CLUSTERING WITH CO-CLUSTERING VIA DISCRETE NON-NEGATIVE MATRIX FACTORIZATION FOR IMAGE IDENTIFICATION
    Nie, Feiping
    Pei, Shenfei
    Wang, Rong
    Li, Xuelong
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2073 - 2077
  • [43] Multitask possibilistic and fuzzy co-clustering algorithm for clustering data with multisource features
    Jiaqi Ren
    Youlong Yang
    Neural Computing and Applications, 2020, 32 : 4785 - 4804
  • [44] Multitask fuzzy Bregman co-clustering approach for clustering data with multisource features
    Sokhandan, Alireza
    Adibi, Peyman
    Sajadi, Mohammadreza
    NEUROCOMPUTING, 2017, 247 : 102 - 114
  • [45] Multitask possibilistic and fuzzy co-clustering algorithm for clustering data with multisource features
    Ren, Jiaqi
    Yang, Youlong
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (09): : 4785 - 4804
  • [46] Ensemble Block Co-clustering: A Unified Framework for Text Data
    Affeldt, Severine
    Labiod, Lazhar
    Nadif, Mohamed
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 5 - 14
  • [47] Hierarchical Co-Clustering: A New Way to Organize the Music Data
    Li, Jingxuan
    Shao, Bo
    Li, Tao
    Ogihara, Mitsunori
    IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (02) : 471 - 481
  • [48] Co-clustering based classification of multi-view data
    Syed Fawad Hussain
    Mohsin Khan
    Imran Siddiqi
    Applied Intelligence, 2022, 52 : 14756 - 14772
  • [49] A Framework for Simultaneous Co-clustering and Learning from Complex Data
    Deodhar, Meghana
    Ghosh, Joydeep
    KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2007, : 250 - 259
  • [50] Spectral co-clustering ensemble
    Huang, Shudong
    Wang, Hongjun
    Li, Dingcheng
    Yang, Yan
    Li, Tianrui
    KNOWLEDGE-BASED SYSTEMS, 2015, 84 : 46 - 55