Fast computation of General SimRank on heterogeneous information network

被引:0
|
作者
Zhang, Chuanyan [1 ]
Hong, Xiaoguang [2 ,3 ]
Zheng, Yongqing [2 ,3 ]
机构
[1] Qilu Normal Univ, Coll Informat Sci & Engn, Jinan 250200, Shandong, Peoples R China
[2] Shandong Univ, Software Sch, Jinan 250101, Shandong, Peoples R China
[3] Dareway Software Co Ltd, Jinan 250000, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
Similarity; General SimRank; Fast computation; Linear system; Heterogeneous information network; SIMILARITY MEASURE; EFFICIENT;
D O I
10.1007/s10791-024-09438-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Similarity computation is a fundamental aspect of information network analysis, underpinning many research tasks including information retrieval, clustering, and recommendation systems. General SimRank (GSR), an extension of the well-known SimRank algorithm, effectively computes link-based global similarities incorporating semantic logic within heterogeneous information networks (HINs). However, GSR inherits the recursive nature of SimRank, making it computationally expensive to achieve convergence through iterative processes. While numerous rapid computation methods exist for SimRank, their direct application to GSR is impeded by differences in their underlying equations. To accelerate GSR computation, we introduce a novel approach based on linear systems. Specifically, we transform the pairwise surfer model of GSR on HINs into a new random walk model on a node-pair graph, establishing an equivalent linear system for GSR. We then develop a fast algorithm utilizing the local push technique to compute all-pair GSR scores with guaranteed accuracy. Additionally, we adapt the local push method for dynamic HINs and introduce a corresponding incremental algorithm. Experimental results on various real datasets demonstrate that our algorithms significantly outperform the traditional power method in both static and dynamic HIN contexts.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] A Fast Heterogeneous Information Network Embedding Framework
    Wang, Kuangmeng
    Zhang, Hong
    Proceedings of the International Joint Conference on Neural Networks, 2024,
  • [2] Fast and Accurate SimRank Computation via Forward Local Push and its Parallelization
    Wang, Yue
    Che, Yulin
    Lian, Xiang
    Chen, Lei
    Luo, Qiong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (12) : 3686 - 3700
  • [3] SimRank Computation on Uncertain Graphs
    Zhu, Rong
    Zou, Zhaonian
    Li, Jianzhong
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 565 - 576
  • [4] Privacy-preserving SimRank over Distributed Information Network
    Chu, Yu-Wei
    Tai, Chih-Hua
    Chen, Ming-Syan
    Yu, Philip S.
    12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012), 2012, : 840 - 845
  • [5] A Novel and Fast SimRank Algorithm
    Lu, Juan
    Gong, Zhiguo
    Lin, Xuemin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (03) : 572 - 585
  • [6] Accuracy estimate and optimization techniques for SimRank computation
    Dmitry Lizorkin
    Pavel Velikhov
    Maxim Grinev
    Denis Turdakov
    The VLDB Journal, 2010, 19 : 45 - 66
  • [7] Heterogeneous Information Network Hashing for Fast Nearest Neighbor Search
    Peng, Zhen
    Luo, Minnan
    Li, Jundong
    Chen, Chen
    Zheng, Qinghua
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2019), PT I, 2019, 11446 : 571 - 586
  • [8] JAMI: fast computation of conditional mutual information for ceRNA network analysis
    Hornakova, Andrea
    List, Markus
    Vreeken, Jilles
    Schulz, Marcel H.
    BIOINFORMATICS, 2018, 34 (17) : 3050 - 3051
  • [9] Accuracy estimate and optimization techniques for SimRank computation
    Lizorkin, Dmitry
    Velikhov, Pavel
    Grinev, Maxim
    Turdakov, Denis
    VLDB JOURNAL, 2010, 19 (01): : 45 - 66
  • [10] Probabilistic SimRank computation over uncertain graphs
    Du, Lingxia
    Li, Cuiping
    Chen, Hong
    Tan, Liwen
    Zhang, Yinglong
    INFORMATION SCIENCES, 2015, 295 : 521 - 535