An effective representation learning model for link prediction in heterogeneous information networks

被引:0
|
作者
Kumar, Vishnu [1 ]
Krishna, P. Radha [1 ]
机构
[1] Natl Inst Technol, Dept CSE, Warangal 506004, Telangana, India
关键词
Link prediction; Node classification; Metapath; Attention mechanism; Semantic confusion; Feature representation learning; Networks embedding; Influence propagation;
D O I
10.1007/s00607-023-01238-x
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Heterogeneous Information Networks (HINs) consist of multiple categories of nodes and edges and encompass rich semantic information. Representing HINs in a low-dimensional feature space is challenging due to its complex structure and rich semantics. In this paper, we focus on link prediction and node classification by learning efficient low-dimensional feature representations of HINs. Metapath-guided walkers have been extensively studied in the literature for learning feature representations. However, the metapath walker does not control the length of random walks, resulting in weak structural and semantic information embeddings. In this work, we present an influence propagation controlled metapath-guided random walk model (called IPCMetapath2Vec) for representation learning in HINs. The model works in three phases: first, we perform node transition to generate a metapath-guided random walk, which is conditioned on two factors: (i) type mapping of the next node according to the metapath, and (ii) compute influence propagation score for each node and detect potential influencers on the walk by a threshold based filter. Next, we provide the collected random walks as input to the skip-gram model to learn each node's feature representation. Lastly, we employ an attention mechanism that aggregates the learned feature representations of each node from various semantic metapath-guided walks, preserving the importance of different semantics. We use these network representation features to address link prediction and multi-label node classification tasks. Experimental results on two public HIN datasets, namely DBLP and IMDB, show that our model outperforms the state-of-the-art representation learning models such as DeepWalk, Node2vec, Metapath2Vec, and HIN2Vec by 4.5% to 17.2% in terms of micro-F1 score for multi-label node classification and 4% to 14.50% in terms of AUC-ROC score for link prediction.
引用
收藏
页码:2185 / 2210
页数:26
相关论文
共 50 条
  • [21] An Efficient Link Prediction Model in Dynamic Heterogeneous Information Networks Based on Multiple Self-attention
    Ruan, Beibei
    Zhu, Cui
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, 2021, 12817 : 62 - 74
  • [22] Link Prediction in Heterogeneous Social Networks
    Negi, Sumit
    Chaudhury, Santanu
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 609 - 617
  • [23] Link Prediction in Aligned Heterogeneous Networks
    Liu, Fangbing
    Xia, Shu-Tao
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PART I, 2015, 9077 : 33 - 44
  • [24] Link Prediction in Temporal Heterogeneous Networks
    Lakshmi, T. Jaya
    Bhavani, S. Durga
    INTELLIGENCE AND SECURITY INFORMATICS (PAISI 2017), 2017, 10241 : 83 - 98
  • [25] Link Prediction by Utilizing Correlations Between Link Types and Path Types in Heterogeneous Information Networks
    Jeong, Hyun Ji
    Taeyeon, Kim
    Kim, Myoung Ho
    DATA MINING AND BIG DATA, DMBD 2016, 2016, 9714 : 156 - 164
  • [26] Rating prediction model based on heterogeneous network representation learning
    Zhan N.
    Liu W.
    Chen X.
    Pu J.
    Liu, Wei (wayne@buaa.edu.cn), 1600, Beijing University of Aeronautics and Astronautics (BUAA) (47): : 1077 - 1084
  • [27] Representation Learning in Heterogeneous Information Networks Based on Hyper Adjacency Matrix
    Yang, Bin
    Wang, Yitong
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT I, 2022, : 747 - 755
  • [28] TransPath: Representation Learning for Heterogeneous Information Networks via Translation Mechanism
    Fang, Yang
    Zhao, Xiang
    Tan, Zhen
    Xiao, Weidong
    IEEE ACCESS, 2018, 6 : 20712 - 20721
  • [29] RL4HIN: Representation Learning for Heterogeneous Information Networks
    Liu, Chunfeng
    Liu, Ying
    Yu, Mei
    Yu, Ruiguo
    Li, Xuewei
    Zhao, Mankun
    Xu, Tianyi
    Liu, Hongwei
    Xu, Linying
    Yu, Jian
    2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
  • [30] Scalable Representation Learning for Dynamic Heterogeneous Information Networks via Metagraphs
    Fang, Yang
    Zhao, Xiang
    Huang, Peixin
    Xiao, Weidong
    de Rijke, Maarten
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2022, 40 (04)