Optimizing path query performance: graph clustering strategies

被引:2
|
作者
Huang, YW
Jing, N
Rundensteiner, EA
机构
[1] IBM Corp, TJ Watson Res Ctr, Hawthorne, NY 10532 USA
[2] Univ Michigan, Ann Arbor, MI 48109 USA
关键词
path query processing; transportation networks; spatial clustering; clustering optimization; geographic information systems;
D O I
10.1016/S0968-090X(00)00049-8
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Path queries over transportation networks are operations required by many Geographic Information Systems applications. Such networks, typically modeled as graphs composed of nodes and links and represented as link relations, can be very large and hence often need to be stored on secondary storage devices. Path query computation over such large persistent networks amounts to high I/O costs due to having to repeatedly bring in links from the link relation from secondary storage into the main memory buffer for processing. This paper is the first to present a comparative experimental evaluation of alternative graph clustering solutions in order to show their effectiveness in path query processing over transportation networks. Clustering optimization is attractive because it does not incur any run-time cost, requires no auxiliary data structures, and is complimentary to many of the existing solutions on path query processing. In this payer, we develop a novel clustering technique, called spatial partition clustering (SPC), that exploits unique properties of transportation networks such as spatial coordinates and high locality. We identify other promising candidates for clustering optimizations from the literature? such as two-way partitioning and approximate topological clustering. We fine-tune them to optimize their I/O behavior for path query processing. Our experimental evaluation of the performance of these graph clustering techniques using an actual city road network as well as randomly generated graphs considers variations in parameters such as memory buffer size, length of the paths, locality, and out-degree. Our experimental results are the foundation for establishing guidelines to select the best clustering technique based on the type of networks. We rind that our SPC performs the best for the highly interconnected city map; the hybrid approach for random graphs with high locality: and the two-way partitioning based on link weights for random graphs with no locality. (C) 2000 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:381 / 408
页数:28
相关论文
共 50 条
  • [1] Query-oriented Graph Clustering
    Kuo, Li-Yen
    Chou, Chung-Kuang
    Chen, Ming-Syan
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2017, PT II, 2017, 10235 : 749 - 761
  • [2] KBQA: Accelerate Fuzzy Path Query on Knowledge Graph
    Zeng, Li
    You, Qiheng
    Lu, Jincheng
    Liu, Shizheng
    Sun, Weijian
    Zhao, Rongqian
    Chen, Xin
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2023, PT I, 2023, 14146 : 462 - 477
  • [3] Mixed Membership Graph Clustering via Systematic Edge Query
    Ibrahim, Shahana
    Fu, Xiao
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2021, 69 : 5189 - 5205
  • [4] Comparative evaluation of region query strategies for DBSCAN clustering
    Galan, Severino F.
    INFORMATION SCIENCES, 2019, 502 : 76 - 90
  • [5] Graph-Based Query Strategies for Active Learning
    Wu, Wei
    Ostendorf, Mari
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (02): : 260 - 269
  • [6] Autonomic care platform for optimizing query performance
    Steurbaut, Kristof
    Latre, Steven
    Decruyenaere, Johan
    De Turck, Filip
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2013, 13
  • [7] Autonomic care platform for optimizing query performance
    Kristof Steurbaut
    Steven Latré
    Johan Decruyenaere
    Filip De Turck
    BMC Medical Informatics and Decision Making, 13
  • [8] Integrated query processing strategies for spatial path queries
    Huang, YW
    Jing, N
    Rundensteiner, EA
    13TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING - PROCEEDINGS, 1997, : 477 - 486
  • [9] Optimizing the Critical Path of Distributed Dataflow Graph Algorithms
    Durrman, Dante
    Saule, Erik
    2023 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, IPDPSW, 2023, : 898 - 904
  • [10] Optimizing regular path expressions using graph schemas
    Fernandez, M
    Suciu, D
    14TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1998, : 14 - 23