Efficient Skyline Keyword-Based Tree Retrieval on Attributed Graphs

被引:0
|
作者
Wu, Dingming [1 ]
Zhang, Zhaofen [1 ]
Jensen, Christian S. [2 ]
Lu, Kezhong [1 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[2] Aalborg Univ, Dept Comp Sci, DK-9220 Aalborg, Denmark
关键词
Filtering algorithms; Europe; Semantics; Indexes; Social networking (online); Keyword search; Query processing; Skyline query; keyword search; attributed graph; query processing; SEARCH; COMPUTATION; SKYGRAPH;
D O I
10.1109/TKDE.2024.3388988
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Attributed graphs are graphs, where the vertices have attributes. Such graphs encompass, e.g., social network graph, citation graphs, and knowledge graphs, which have numerous real-world applications. Keyword-based search is a prominent and user-friendly way of querying attributed graphs. One widely used approach to keyword search adopts tree-based query semantics that relies on scoring functions that aggregate distances from a root to keyword-matched vertices. However, it is non-trivial to design scoring functions that capture different users' keyword preferences. This study defines and solves the skyline KTree retrieval problem that combines keyword querying with skyline functionality on attributed graphs. The result of a skyline KTree query is independent of scoring functions. Hence, no matter which keywords are preferred, users can always find their favorite KTrees in a result. To enable efficient skyline KTree retrieval, we propose algorithmFilterRefine that first identifies candidate results and then uses them for search space pruning. Computing distances between keywords and vertices is expensive and dominates the computational cost ofFilterRefine. Inspired by subspace skyline query techniques, we convert the skyline KTree retrieval problem into a multi-dimensional subspace skyline problem and propose algorithm MultiDiSkylineOpt. This algorithm is able to reuse skylines in subspaces and uses bounds on all dimensions to accelerate distance computation. Experimental results on real datasets show that a baseline algorithm cannot report results within a 500 second cut-off time, while the proposed algorithms are able to compute results in reasonable time. In particular, MultiDiSkylineOpt is able to efficiently retrieve skyline KTrees on large graphs with millions of nodes and hundreds of millions of edges.
引用
收藏
页码:6056 / 6070
页数:15
相关论文
共 50 条
  • [21] Keyword-Based Retrieval of Frequent Location Sets in Geotagged Photo Trails
    Mehta, Paras
    Sacharidis, Dimitris
    Skoutas, Dimitrios
    Voisard, Agnes
    PROCEEDINGS OF THE 2016 ACM WEB SCIENCE CONFERENCE (WEBSCI'16), 2016, : 348 - 349
  • [22] Keyword-based Topic Modeling and Keyword Selection
    Wang, Xingyu
    Zhang, Lida
    Klabjan, Diego
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 1148 - 1154
  • [23] Trusting the results in cross-lingual keyword-based ffmage retrieval
    Karlgren, Jussi
    Olsson, Fredrik
    EVALUATION OF MULTILINGUAL AND MULTI-MODAL INFORMATION RETRIEVAL, 2007, 4730 : 217 - +
  • [24] KA: Keyword-based auditing with frequency hiding and retrieval reliability for smart government
    Xue, Jingting
    Luo, Shuqin
    Deng, Qinfang
    Shi, Lingjie
    Zhang, Xiaojun
    Wang, Huaxiong
    JOURNAL OF SYSTEMS ARCHITECTURE, 2023, 138
  • [25] Search Text to Retrieve Graphs: A Scalable RDF Keyword-Based Search System
    Dosso, Dennis
    Silvello, Gianmaria
    IEEE ACCESS, 2020, 8 : 14089 - 14111
  • [26] Data Manipulation Technique in a Keyword-Based Text Retrieval System on Encrypted Data
    Randhawa, Navjeet Kaur
    Samantaray, Prabhat Keshari
    SOFT COMPUTING IN DATA ANALYTICS, SCDA 2018, 2019, 758 : 361 - 368
  • [27] Search Text to Retrieve Graphs: a Scalable RDF Keyword-Based Search System
    Dosso D.
    Silvello G.
    IEEE Access, 2020, 8 : 14089 - 14111
  • [28] Keyword fusion to support efficient keyword-based search in peer-to-peer file sharing
    Liu, LT
    Ryu, KD
    Lee, KW
    2004 IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID - CCGRID 2004, 2004, : 269 - 276
  • [29] Keyword-based Semantic Retrieval System using Location Information in a Mobile Environment
    Lee, Tae-Hoon
    Kim, Jung-Hyun
    Kwon, Hyeong-Joon
    Hong, Kwang-Seok
    2009 INTERNATIONAL SYMPOSIUM ON WEB INFORMATION SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2009, : 434 - 438
  • [30] Common secure index for conjunctive keyword-based retrieval over encrypted data
    Wang, Peishun
    Wang, Huaxiong
    Pieprzyk, Josef
    SECURE DATA MANAGEMENT, PROCEEDINGS, 2007, 4721 : 108 - +