Accuracy estimate and optimization techniques for SimRank computation

被引:67
|
作者
Lizorkin, Dmitry [1 ]
Velikhov, Pavel [1 ]
Grinev, Maxim [1 ]
Turdakov, Denis [1 ]
机构
[1] Russian Acad Sci, Inst Syst Programming, Moscow 109004, Russia
来源
VLDB JOURNAL | 2010年 / 19卷 / 01期
关键词
Similarity measure; Graph theory; SimRank; Algorithm; Computational complexity;
D O I
10.1007/s00778-009-0168-8
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The measure of similarity between objects is a very useful tool in many areas of computer science, including information retrieval. SimRank is a simple and intuitive measure of this kind, based on a graph-theoretic model. SimRank is typically computed iteratively, in the spirit of PageRank. However, existing work on SimRank lacks accuracy estimation of iterative computation and has discouraging time complexity. In this paper, we present a technique to estimate the accuracy of computing SimRank iteratively. This technique provides a way to find out the number of iterations required to achieve a desired accuracy when computing SimRank. We also present optimization techniques that improve the computational complexity of the iterative algorithm from O(n(4)) in the worst case to min(O(nl), O(n(3)/log(2)n)), with n denoting the number of objects, and l denoting the number object-to-object relationships. We also introduce a threshold sieving heuristic and its accuracy estimation that further improves the efficiency of the method. As a practical illustration of our techniques, we computed SimRank scores on a subset of English Wikipedia corpus, consisting of the complete set of articles and category links.
引用
收藏
页码:45 / 66
页数:22
相关论文
共 50 条
  • [41] MINIMAX ESTIMATE ACCURACY
    PSHENICHNY, BN
    POKOTILO, VG
    DOPOVIDI AKADEMII NAUK UKRAINSKOI RSR SERIYA A-FIZIKO-MATEMATICHNI TA TECHNICHNI NAUKI, 1982, (03): : 61 - 63
  • [42] AN ESTIMATE OF ACCURACY IN IDENTIFICATION
    MASLOV, EP
    AUTOMATION AND REMOTE CONTROL, 1966, 27 (10) : 1707 - &
  • [43] Accuracy Optimization in Speech Pathology Diagnosis with Data Preprocessing Techniques
    Teixeira Fernandes, Joana Filipa
    Freitas, Diamantino Rui
    Teixeira, Joao Paulo
    OPTIMIZATION, LEARNING ALGORITHMS AND APPLICATIONS, PT I, OL2A 2023, 2024, 1981 : 287 - 299
  • [44] Joint Computation Offloading and Sampling Interval Optimization for Accuracy-Guaranteed Surveillance
    Nishio, Takayuki
    Inoue, Yoshiaki
    Nakayama, Yu
    Katsurai, Marie
    2021 IEEE 18TH ANNUAL CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE (CCNC), 2021,
  • [45] Comparative study of accuracy and computation time for optimal network reconfiguration techniques via simulation
    Deese, Anthony S.
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2014, 63 : 394 - 400
  • [46] Performance comparison of feature reduction techniques in-terms of compactness, computation time and accuracy
    Vijai, Praveen
    Sivakumar, Bagavathi P.
    2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2018, : 374 - 380
  • [47] Intelligent computation techniques for optimization of the shortest path in an asynchronous network-on-chip
    Ilamathi, K.
    Rangarajan, P.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 1): : 335 - 346
  • [48] Intelligent computation techniques for optimization of the shortest path in an asynchronous network-on-chip
    K. Ilamathi
    P. Rangarajan
    Cluster Computing, 2019, 22 : 335 - 346
  • [49] Fuzzy Logic for Improving Interactive Evolutionary Computation Techniques for Ad Text Optimization
    Madera, Quetzali
    Garcia, Mario
    Castillo, Oscar
    NOVEL DEVELOPMENTS IN UNCERTAINTY REPRESENTATION AND PROCESSING: ADVANCES IN INTUITIONISTIC FUZZY SETS AND GENERALIZED NETS, 2016, 401 : 291 - 300
  • [50] Multicast tree computation for group communication in mobile networks using optimization techniques
    Gopalan, N. P.
    Mala, C.
    Shriram, R.
    Agarwal, Shashank
    2006 INTERNATIONAL SYMPOSIUM ON AD HOC AND UBIQUITOUS COMPUTING, 2007, : 84 - 89