Efficient Correlation Search from Graph Databases

被引:12
|
作者
Ke, Yiping [1 ]
Cheng, James [1 ]
Ng, Wilfred [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Kowloon, Hong Kong, Peoples R China
关键词
Correlation; graph databases; Pearson's correlation coefficient;
D O I
10.1109/TKDE.2008.86
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a new problem of correlation mining from graph databases, called Correlated Graph Search (CGS). CGS adopts Pearson's correlation coefficient as the correlation measure to take into account the occurrence distributions of graphs. However, the CGS problem poses significant challenges, since every subgraph of a graph in the database is a candidate, but the number of subgraphs is exponential. We derive two necessary conditions that set bounds on the occurrence probability of a candidate in the database. With this result, we devise an efficient algorithm that mines the candidate set from a much smaller projected database, and thus, we are able to obtain a significantly smaller set of candidates. Three heuristic rules are further developed to refine the candidate set. We also make use of the bounds to directly answer high-support queries without mining the candidates. Our experimental results demonstrate the efficiency of our algorithm. Finally, we show that our algorithm provides a general solution when most of the commonly used correlation measures are used to generalize the CGS problem.
引用
收藏
页码:1601 / 1615
页数:15
相关论文
共 50 条
  • [21] Subgraph Isomorphism Search in Massive Graph Databases
    Nabti, Chemseddine
    Seba, Hamida
    IOTBD: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTERNET OF THINGS AND BIG DATA, 2016, : 204 - 213
  • [22] An Efficient Search Algorithm for Fingerprint Databases
    Cha, Guang-Ho
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2019, 35 (02) : 471 - 484
  • [23] Efficient search and retrieval in biometric databases
    Mhatre, A
    Palla, S
    Chikkerur, S
    Govindaraju, V
    Biometric Technology for Human Identification II, 2005, 5779 : 265 - 273
  • [24] Graph Similarity Search with Edit Distance Constraint in Large Graph Databases
    Zheng, Weiguo
    Zou, Lei
    Lian, Xiang
    Wang, Dong
    Zhao, Dongyan
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1595 - 1600
  • [25] GHashing: Semantic Graph Hashing for Approximate Similarity Search in Graph Databases
    Qin, Zongyue
    Bai, Yunsheng
    Sun, Yizhou
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 2062 - 2072
  • [26] Efficient mining of minimal distinguishing subgraph patterns from graph databases
    Zeng, Zhiping
    Wang, Jianyong
    Zhou, Lizhu
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2008, 5012 : 1062 - 1068
  • [27] Efficient query processing on uncertain graph databases
    School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
    Jisuanji Xuebao, 2009, 10 (2066-2079): : 2066 - 2079
  • [28] Compact and efficient representation of general graph databases
    Alvarez-Garcia, Sandra
    Freire, Borja
    Ladra, Susana
    Pedreira, Oscar
    KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 60 (03) : 1479 - 1510
  • [29] Compact and efficient representation of general graph databases
    Sandra Álvarez-García
    Borja Freire
    Susana Ladra
    Óscar Pedreira
    Knowledge and Information Systems, 2019, 60 : 1479 - 1510
  • [30] Geometric Graph Indexing for Similarity Search in Scientific Databases
    Armiti, Ayser
    Gertz, Michael
    28TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM) 2016), 2016,