Semi-supervised Co-Clustering on Attributed Heterogeneous Information Networks

被引:9
|
作者
Ji, Yugang [1 ]
Shi, Chuan [1 ]
Fang, Yuan [2 ]
Kong, Xiangnan [3 ]
Yin, Mingyang [4 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
[2] Singapore Management Univ, Singapore, Singapore
[3] Worcester Polytech Inst, Worcester, MA 01609 USA
[4] Alibaba Grp, Hangzhou, Peoples R China
基金
中国国家自然科学基金; 新加坡国家研究基金会;
关键词
Co-clustering; Heterogeneous information network; Meta-paths; Matrix tri-factorization; Semi-supervised learning; MATRIX FACTORIZATION;
D O I
10.1016/j.ipm.2020.102338
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Node clustering on heterogeneous information networks (HINs) plays an important role in many real-world applications. While previous research mainly clusters same-type nodes independently via exploiting structural similarity search, they ignore the correlations of different-type nodes. In this paper, we focus on the problem of co-clustering heterogeneous nodes where the goal is to mine the latent relevance of heterogeneous nodes and simultaneously partition them into the corresponding type-aware clusters. This problem is challenging in two aspects. First, the similarity or relevance of nodes is not only associated with multiple meta-path-based structures but also related to numerical and categorical attributes. Second, clusters and similarity/relevance searches usually promote each other. To address this problem, we first design a learnable overall relevance measure that integrates the structural and attributed relevance by employing meta-paths and attribute projection. We then propose a novel approach, called SCCAIN, to co-cluster heterogeneous nodes based on constrained orthogonal non-negative matrix tri-factorization. Furthermore, an end-to-end framework is developed to jointly optimize the relevance measures and co-clustering. Extensive experiments on real-world datasets not only demonstrate that SCCAIN consistently outperforms state-of-the-art methods but also validate the effectiveness of integrating attributed and structural information for co-clustering.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Semi-supervised fuzzy co-clustering for hospital-cost analysis from electronic medical records
    Duong Thi Thu Huyen
    Tran Manh Tuan
    Le Hoang Son
    Drogoul, Alexis
    2016 EIGHTH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE), 2016, : 25 - 30
  • [22] AN MBO SCHEME FOR CLUSTERING AND SEMI-SUPERVISED CLUSTERING OF SIGNED NETWORKS
    Cucuringu, Mihai
    Pizzoferrato, Andrea
    Van Gennip, Yves
    COMMUNICATIONS IN MATHEMATICAL SCIENCES, 2021, 19 (01) : 73 - 109
  • [23] Customer Clustering Using Semi-supervised Geographic Information
    Lin, Zhonglin
    Chen, Gang
    Bai, Xinxin
    Lv, Hairong
    Yin, Wenjun
    Dong, Jin
    PROCEEDINGS OF 2009 IEEE INTERNATIONAL CONFERENCE ON SERVICE OPERATION, LOGISTICS AND INFORMATICS, 2009, : 465 - +
  • [24] Semi-supervised Clustering using Similarity Neural Networks
    Melacci, Stefano
    Maggini, Marco
    Sarti, Lorenzo
    IJCNN: 2009 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1- 6, 2009, : 615 - 622
  • [25] Semi-supervised learning in unbalanced networks with heterogeneous degree
    Li, Ting
    Ying, Ningchen
    Yu, Xianshi
    Jing, Bing-Yi
    STATISTICS AND ITS INTERFACE, 2024, 17 (03) : 501 - 516
  • [26] Semi-supervised clustering methods
    Bair, Eric
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2013, 5 (05): : 349 - 361
  • [27] SEMI-SUPERVISED SPECTRAL CLUSTERING
    Mai, Xiaoyi
    Couillet, Romain
    2018 CONFERENCE RECORD OF 52ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2018, : 2012 - 2016
  • [28] A review on semi-supervised clustering
    Cai, Jianghui
    Hao, Jing
    Yang, Haifeng
    Zhao, Xujun
    Yang, Yuqing
    INFORMATION SCIENCES, 2023, 632 : 164 - 200
  • [29] Semi-supervised Selective Clustering Ensemble based on constraint information
    Ma, Tinghuai
    Zhang, Zheng
    Guo, Lei
    Wang, Xin
    Qian, Yurong
    Al-Nabhan, Najla
    NEUROCOMPUTING, 2021, 462 : 412 - 425
  • [30] Semi-supervised image clustering with multi-modal information
    Jianqing Liang
    Yahong Han
    Qinghua Hu
    Multimedia Systems, 2016, 22 : 149 - 160