Accuracy estimate and optimization techniques for SimRank computation

被引:67
|
作者
Lizorkin, Dmitry [1 ]
Velikhov, Pavel [1 ]
Grinev, Maxim [1 ]
Turdakov, Denis [1 ]
机构
[1] Russian Acad Sci, Inst Syst Programming, Moscow 109004, Russia
来源
VLDB JOURNAL | 2010年 / 19卷 / 01期
关键词
Similarity measure; Graph theory; SimRank; Algorithm; Computational complexity;
D O I
10.1007/s00778-009-0168-8
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The measure of similarity between objects is a very useful tool in many areas of computer science, including information retrieval. SimRank is a simple and intuitive measure of this kind, based on a graph-theoretic model. SimRank is typically computed iteratively, in the spirit of PageRank. However, existing work on SimRank lacks accuracy estimation of iterative computation and has discouraging time complexity. In this paper, we present a technique to estimate the accuracy of computing SimRank iteratively. This technique provides a way to find out the number of iterations required to achieve a desired accuracy when computing SimRank. We also present optimization techniques that improve the computational complexity of the iterative algorithm from O(n(4)) in the worst case to min(O(nl), O(n(3)/log(2)n)), with n denoting the number of objects, and l denoting the number object-to-object relationships. We also introduce a threshold sieving heuristic and its accuracy estimation that further improves the efficiency of the method. As a practical illustration of our techniques, we computed SimRank scores on a subset of English Wikipedia corpus, consisting of the complete set of articles and category links.
引用
收藏
页码:45 / 66
页数:22
相关论文
共 50 条
  • [31] Deep Learning Techniques for Accuracy Optimization in Wireless Networks
    AL-Twalah, Sahar Suliman
    AL-Ammar, Fadhilah Mousa
    Eljack, Sarah M.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2020, 20 (03): : 161 - 167
  • [32] Computation of bending behavior of woven structures using optimization techniques
    Sagar, TV
    Potluri, P
    TEXTILE RESEARCH JOURNAL, 2004, 74 (10) : 879 - 886
  • [33] Evaluation of the improvement in the position estimate accuracy of UMTS mobiles with hybrid positioning techniques
    Pagès-Zamora, A
    Vidal, J
    IEEE 55TH VEHICULAR TECHNOLOGY CONFERENCE, VTC SPRING 2002, VOLS 1-4, PROCEEDINGS, 2002, : 1631 - 1635
  • [34] Hybrid method for global optimization using more accuracy interval computation
    崔中浩
    雷咏梅
    Advances in Manufacturing, 2011, (05) : 445 - 450
  • [35] Hybrid method for global optimization using more accuracy interval computation
    崔中浩
    雷咏梅
    Journal of Shanghai University(English Edition), 2011, 15 (05) : 445 - 450
  • [36] HitSim: An Efficient Algorithm for Single-Source and Top-k SimRank Computation
    Bai, Jing
    Zhou, Junfeng
    Chen, Shuotong
    Du, Ming
    Chen, Ziyang
    Min, Mengtao
    INFORMATION, 2024, 15 (06)
  • [37] SimSky: An Accuracy-Aware Algorithm for Single-Source SimRank Search
    Yan, Liping
    Yu, Weiren
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT III, 2023, 14171 : 226 - 241
  • [38] UniWalk: Unidirectional Random Walk Based Scalable SimRank Computation over Large Graph
    Luo, XiongCai
    Gao, Jun
    Zhou, Chang
    Yu, Jeffrey Xu
    2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, : 325 - 336
  • [39] On Optimization Techniques for the Construction of an Exponential Estimate for Delayed Recurrent Neural Networks
    Martsenyuk, Vasyl
    Rajba, Stanislaw
    Karpinski, Mikolaj
    SYMMETRY-BASEL, 2020, 12 (10): : 1 - 11
  • [40] UniWalk: Unidirectional Random Walk Based Scalable SimRank Computation over Large Graph
    Song, Junshuai
    Luo, Xiongcai
    Gao, Jun
    Zhou, Chang
    Wei, Hu
    Yu, Jeffery Xu
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (05) : 992 - 1006