Efficient Skyline Keyword-Based Tree Retrieval on Attributed Graphs

被引:0
|
作者
Wu, Dingming [1 ]
Zhang, Zhaofen [1 ]
Jensen, Christian S. [2 ]
Lu, Kezhong [1 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[2] Aalborg Univ, Dept Comp Sci, DK-9220 Aalborg, Denmark
关键词
Filtering algorithms; Europe; Semantics; Indexes; Social networking (online); Keyword search; Query processing; Skyline query; keyword search; attributed graph; query processing; SEARCH; COMPUTATION; SKYGRAPH;
D O I
10.1109/TKDE.2024.3388988
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Attributed graphs are graphs, where the vertices have attributes. Such graphs encompass, e.g., social network graph, citation graphs, and knowledge graphs, which have numerous real-world applications. Keyword-based search is a prominent and user-friendly way of querying attributed graphs. One widely used approach to keyword search adopts tree-based query semantics that relies on scoring functions that aggregate distances from a root to keyword-matched vertices. However, it is non-trivial to design scoring functions that capture different users' keyword preferences. This study defines and solves the skyline KTree retrieval problem that combines keyword querying with skyline functionality on attributed graphs. The result of a skyline KTree query is independent of scoring functions. Hence, no matter which keywords are preferred, users can always find their favorite KTrees in a result. To enable efficient skyline KTree retrieval, we propose algorithmFilterRefine that first identifies candidate results and then uses them for search space pruning. Computing distances between keywords and vertices is expensive and dominates the computational cost ofFilterRefine. Inspired by subspace skyline query techniques, we convert the skyline KTree retrieval problem into a multi-dimensional subspace skyline problem and propose algorithm MultiDiSkylineOpt. This algorithm is able to reuse skylines in subspaces and uses bounds on all dimensions to accelerate distance computation. Experimental results on real datasets show that a baseline algorithm cannot report results within a 500 second cut-off time, while the proposed algorithms are able to compute results in reasonable time. In particular, MultiDiSkylineOpt is able to efficiently retrieve skyline KTrees on large graphs with millions of nodes and hundreds of millions of edges.
引用
收藏
页码:6056 / 6070
页数:15
相关论文
共 50 条
  • [1] Keyword-Based Betweenness Centrality Maximization in Attributed Graphs
    Wu, Xiao
    Wu, Yanping
    Wang, Xiaoyang
    Yang, Zhengyi
    Zhang, Wenjie
    Zhang, Ying
    DATABASES THEORY AND APPLICATIONS, ADC 2024, 2025, 15449 : 209 - 223
  • [2] Efficient Declustering Techniques for keyword-based Information Retrieval
    Behl, S
    Verma, RM
    PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, 2002, : 294 - 300
  • [3] An effective and efficient approach for keyword-based XML retrieval
    Li, XG
    Gong, H
    Wang, DL
    Yu, G
    ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2005, 3739 : 56 - 67
  • [4] Keyword-based Vehicle Retrieval
    Park, Eun-Ju
    Kim, Hoyoung
    Jeong, Seonghwan
    Kang, Byungkon
    Kwon, YoungMin
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 4215 - 4222
  • [5] Keyword-based information retrieval for the WoT
    Xylomenos, George
    Zafeiratos, Evangelos
    Prokopakis, Marios
    Proceedings of the 4th ACM/IEEE Symposium on Edge Computing, SEC 2019, 2019, : 407 - 412
  • [6] Keyword-based information retrieval for the WoT
    Xylomenos, George
    Zafeiratos, Evangelos
    Prokopakis, Marios
    SEC'19: PROCEEDINGS OF THE 4TH ACM/IEEE SYMPOSIUM ON EDGE COMPUTING, 2019, : 407 - 412
  • [7] Structural feedback for keyword-based XML retrieval
    Schenkel, Ralf
    Theobald, Martin
    ADVANCES IN INFORMATION RETRIEVAL, 2006, 3936 : 326 - 337
  • [8] An Efficient Scheme of Common Secure Indices for Conjunctive Keyword-Based Retrieval on Encrypted Data
    Wang, Peishun
    Wang, Huaxiong
    Pieprzyk, Josef
    INFORMATION SECURITY APPLICATIONS, 2009, 5379 : 145 - 159
  • [9] Term proximity scoring for keyword-based retrieval systems
    Rasolofo, Y
    Savoy, J
    ADVANCES IN INFORMATION RETRIEVAL, 2003, 2633 : 207 - 218
  • [10] Implementation and Performance Evaluation of Keyword-Based Content Retrieval
    Kurita, Toshihiko
    Suga, Junichi
    Ito, Akira
    2017 NINTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN 2017), 2017, : 69 - 74