Efficient Correlation Search from Graph Databases

被引:12
|
作者
Ke, Yiping [1 ]
Cheng, James [1 ]
Ng, Wilfred [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Kowloon, Hong Kong, Peoples R China
关键词
Correlation; graph databases; Pearson's correlation coefficient;
D O I
10.1109/TKDE.2008.86
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a new problem of correlation mining from graph databases, called Correlated Graph Search (CGS). CGS adopts Pearson's correlation coefficient as the correlation measure to take into account the occurrence distributions of graphs. However, the CGS problem poses significant challenges, since every subgraph of a graph in the database is a candidate, but the number of subgraphs is exponential. We derive two necessary conditions that set bounds on the occurrence probability of a candidate in the database. With this result, we devise an efficient algorithm that mines the candidate set from a much smaller projected database, and thus, we are able to obtain a significantly smaller set of candidates. Three heuristic rules are further developed to refine the candidate set. We also make use of the bounds to directly answer high-support queries without mining the candidates. Our experimental results demonstrate the efficiency of our algorithm. Finally, we show that our algorithm provides a general solution when most of the commonly used correlation measures are used to generalize the CGS problem.
引用
收藏
页码:1601 / 1615
页数:15
相关论文
共 50 条
  • [31] Authenticated Subgraph Similarity Search in Outsourced Graph Databases
    Peng, Yun
    Fan, Zhe
    Choi, Byron
    Xu, Jianliang
    Bhowmick, Sourav S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (07) : 1838 - 1860
  • [32] From Databases to Graph Visualization
    Gilbert, Frederic
    Auber, David
    2010 14TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION (IV 2010), 2010, : 128 - 133
  • [33] PICASSO: Exploratory Search of Connected Subgraph Substructures in Graph Databases
    Huang, Kai
    Bhowmick, Sourav S.
    Zhou, Shuigeng
    Choi, Byron
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (12): : 1861 - 1864
  • [34] Efficient processing of graph similarity search
    Ryan Choi
    Chin-Wan Chung
    World Wide Web, 2015, 18 : 633 - 659
  • [35] Efficient algorithms for supergraph query processing on graph databases
    Shuo Zhang
    Xiaofeng Gao
    Weili Wu
    Jianzhong Li
    Hong Gao
    Journal of Combinatorial Optimization, 2011, 21 : 159 - 191
  • [36] Efficient processing of graph similarity search
    Choi, Ryan
    Chung, Chin-Wan
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2015, 18 (03): : 633 - 659
  • [37] Towards Representation Independent Similarity Search Over Graph Databases
    Chodpathumwan, Yodsawalai
    Aleyasen, Amirhossein
    Termehchy, Arash
    Sun, Yizhou
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 2233 - 2238
  • [38] Efficient algorithms for supergraph query processing on graph databases
    Zhang, Shuo
    Gao, Xiaofeng
    Wu, Weili
    Li, Jianzhong
    Gao, Hong
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 2011, 21 (02) : 159 - 191
  • [39] Graph Modeling from Relational Databases
    Lima Filho, Silas P.
    Cavalcanti, Maria C.
    Justel, Claudia M.
    2017 XLIII LATIN AMERICAN COMPUTER CONFERENCE (CLEI), 2017,
  • [40] Efficient Search and Localization of Human Actions in Video Databases
    Shao, Ling
    Jones, Simon
    Li, Xuelong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2014, 24 (03) : 504 - 512