Top-k similarity search in heterogeneous information networks with x-star network schema

被引:34
|
作者
Zhang, Mingxi [1 ,2 ]
Hu, Hao [2 ]
He, Zhenying [2 ]
Wang, Wei [2 ]
机构
[1] Univ Shanghai Sci & Technol, Coll Commun & Art Design, Shanghai 200093, Peoples R China
[2] Fudan Univ, Sch Comp Sci, Shanghai 201203, Peoples R China
基金
美国国家科学基金会;
关键词
Similarity search; Information network; x-star network schema;
D O I
10.1016/j.eswa.2014.08.039
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An x-star network is an information network which consists of centers with connections among themselves, and different type attributes linking to these centers. As x-star networks become ubiquitous, extracting knowledge from x-star networks has become an important task. Similarity search in x-star network aims to find the centers similar to a given query center, which has numerous applications including collaborative filtering, community mining and web search. Although existing methods yield promising similar results, such as SimRank and P-Rank, they are not applicable for massive x-star networks. In this paper, we propose a structural-based similarity measure, NetSim, towards efficiently computing similarity between centers in an x-star network. The similarity between attributes is computed in the pre-processing stage by the expected meeting probability over attribute network that is extracted from the whole structure of x-star network. The similarity between centers is computed online according to the attribute similarities based on the intuition that similar centers are linked with similar attributes. NetSim requires less time and space cost than existing methods since the scale of attribute network is significantly smaller than the whole x-star network. For supporting fast online query processing, we develop a pruning algorithm by building a pruning index, which prunes candidate centers that are not promising. Extensive experiments demonstrate the effectiveness and efficiency of our method through comparing with the state-of-the-art measures. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:699 / 712
页数:14
相关论文
共 50 条
  • [21] Semantic enhanced Top-k similarity search on weighted HIN
    Yun Zhang
    Minghe Yu
    Tiancheng Zhang
    Ge Yu
    Neural Computing and Applications, 2022, 34 : 16911 - 16927
  • [22] Semantic enhanced Top-k similarity search on weighted HIN
    Zhang, Yun
    Yu, Minghe
    Zhang, Tiancheng
    Yu, Ge
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (19): : 16911 - 16927
  • [23] Integrating Meta-Path Selection with User-Preference for Top-k Relevant Search in Heterogeneous Information Networks
    Bu, Shaoli
    Hong, Xiaoguang
    Peng, Zhaohui
    Li, Qingzhong
    PROCEEDINGS OF THE 2014 IEEE 18TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2014, : 301 - 306
  • [24] Top-K structural diversity search in large networks
    Xin Huang
    Hong Cheng
    Rong-Hua Li
    Lu Qin
    Jeffrey Xu Yu
    The VLDB Journal, 2015, 24 : 319 - 343
  • [25] Top-K Structural Diversity Search in Large Networks
    Xin Huang
    Hong Cheng
    Li, Rong-Hua
    Lu Qin
    Yu, Jeffrey Xu
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (13): : 1618 - 1629
  • [26] Top-K structural diversity search in large networks
    Huang, Xin
    Cheng, Hong
    Li, Rong-Hua
    Qin, Lu
    Yu, Jeffrey Xu
    VLDB JOURNAL, 2015, 24 (03): : 319 - 343
  • [27] Approximate top-k structural similarity search over XML documents
    Xie, T
    Sha, CF
    Wang, XL
    Zhou, AY
    FRONTIERS OF WWW RESEARCH AND DEVELOPMENT - APWEB 2006, PROCEEDINGS, 2006, 3841 : 319 - 330
  • [28] Subspace Similarity Search Using the Ideas of Ranking and Top-k Retrieval
    Bernecker, Thomas
    Emrich, Tobias
    Graf, Franz
    Kriegel, Hans-Peter
    Kroeger, Peer
    Renz, Matthias
    Schubert, Erich
    Zimek, Arthur
    2010 IEEE 26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDE 2010), 2010, : 4 - 9
  • [29] Fast Top-K Graph Similarity Search Via Representative Matrices
    Sun, Zhigang
    Huo, Hongwei
    Chen, Xiaoyang
    IEEE ACCESS, 2018, 6 : 21408 - 21417
  • [30] Top-k String Similarity Search with Edit-Distance Constraints
    Deng, Dong
    Li, Guoliang
    Feng, Jianhua
    Li, Wen-Syan
    2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2013, : 925 - 936