Similarity enhancement of heterogeneous networks by weighted incorporation of information

被引:0
|
作者
Fatemeh Baharifard
Vahid Motaghed
机构
[1] Institute for Research in Fundamental Sciences (IPM),School of Computer Science
来源
关键词
Heterogeneous Graph; Unsupervised learning; Natural language processing; Node clustering;
D O I
暂无
中图分类号
学科分类号
摘要
In many real-world datasets, different aspects of information are combined, so the data is usually represented as heterogeneous graphs whose nodes and edges have different types. Learning representations in heterogeneous networks is one of the most important topics that can be utilized to extract important details from the networks with the embedding methods. In this paper, we introduce a new framework for embedding heterogeneous graphs. Our model relies on weighted heterogeneous networks with star structures that take structural and attributive similarity into account as well as semantic knowledge. The target nodes form the center of the star and the different attributes of the target nodes form the points of the star. The edge weights are calculated based on three aspects, including the natural language processing in texts, the relationship between different attributes of the dataset and the co-occurrence of each attribute pair in target nodes. We strengthen the similarities between the target nodes by examining the latent connections between the attribute nodes. We find these indirect connections by considering the approximate shortest path between the attributes. By applying the side effect of the star components to the central component, the heterogeneous network is reduced to a homogeneous graph with enhanced similarities. Thus, we can embed this homogeneous graph to capture the similar target nodes. We evaluate our framework for the clustering task and show that our method is more accurate than previous unsupervised algorithms for real-world datasets.
引用
收藏
页码:3133 / 3156
页数:23
相关论文
共 50 条
  • [1] Similarity enhancement of heterogeneous networks by weighted incorporation of information
    Baharifard, Fatemeh
    Motaghed, Vahid
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (05) : 3133 - 3156
  • [2] A Weighted Similarity Measure Based on Meta Structure in Heterogeneous Information Networks
    Li, Zhaochen
    Wang, Hengliang
    KNOWLEDGE MANAGEMENT AND ACQUISITION FOR INTELLIGENT SYSTEMS (PKAW 2018), 2018, 11016 : 271 - 281
  • [3] A Semantic Path-Based Similarity Measure for Weighted Heterogeneous Information Networks
    Yang, Chunxue
    Zhao, Chenfei
    Wang, Hengliang
    Qiu, Riming
    Li, Yuan
    Mu, Kedian
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2018), PT I, 2018, 11061 : 311 - 323
  • [4] Exploiting Transitive Similarity and Temporal Dynamics for Similarity Search in Heterogeneous Information Networks
    He, Jiazhen
    Bailey, James
    Zhang, Rui
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2014, PT II, 2014, 8422 : 141 - 155
  • [5] Top-k Similarity Join in Heterogeneous Information Networks
    Xiong, Yun
    Zhu, Yangyong
    Yu, Philip S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (06) : 1710 - 1723
  • [6] KnowSim: A document similarity measure on structured heterogeneous information networks
    School of EECS, Peking University, China
    不详
    Proc. IEEE Int. Conf. Data Min. ICDM, 1600, (1015-1020):
  • [7] A semantic-rich similarity measure in heterogeneous information networks
    Zhou, Yu
    Huang, Jianbin
    Li, He
    Sun, Heli
    Peng, Yan
    Xu, Yueshen
    KNOWLEDGE-BASED SYSTEMS, 2018, 154 : 32 - 42
  • [8] Edit Distance Based Similarity Search of Heterogeneous Information Networks
    Lu, Jianhua
    Lu, Ningyun
    Ma, Sipei
    Zhang, Baili
    DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA 2018), PT II, 2018, 11030 : 195 - 202
  • [9] Neural PathSim for Inductive Similarity Search in Heterogeneous Information Networks
    Xiao, Wenyi
    Zhao, Huan
    Zheng, Vincent W.
    Song, Yangqiu
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 2201 - 2210
  • [10] HowSim: A General and Effective Similarity Measure on Heterogeneous Information Networks
    Wang, Yue
    Wang, Zhe
    Zhao, Ziyuan
    Li, Zijian
    Jian, Xun
    Chen, Lei
    Song, Jianchun
    2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 1954 - 1957