Index-based query processing on distributed multidimensional data

被引:6
|
作者
Tsatsanifos, George [1 ]
Sacharidis, Dimitris [2 ]
Sellis, Timos [2 ]
机构
[1] Natl Tech Univ Athens, Sch Elect & Comp Engn, Athens, Greece
[2] RC Athena, Inst Management Informat Syst, Athens, Greece
关键词
Distributed index; Distributed query processing; Peer-to-peer networks; Nearest neighbor query; SEARCH;
D O I
10.1007/s10707-012-0163-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work introduces decentralized query processing techniques based on MIDAS, a novel distributed multidimensional index. In particular, MIDAS implements a distributed k-d tree, where leaves correspond to peers, and internal nodes dictate message routing. MIDAS requires that peers maintain little network information, and features mechanisms that support fault tolerance and load balancing. The proposed algorithms process point and range queries over the multidimensional indexed space in only O(log n) hops in expectance, where n is the network size. For nearest neighbor queries, two processing alternatives are discussed. The first, termed eager processing, has low latency (expected value of O(log n) hops) but may involve a large number of peers. The second, termed iterative processing, has higher latency (expected value of O(log(2) n) hops) but involves far fewer peers. A detailed experimental evaluation demonstrates that our query processing techniques outperform existing methods for settings involving real spatial data as well as in the case of high dimensional synthetic data.
引用
收藏
页码:489 / 519
页数:31
相关论文
共 50 条
  • [21] An index-based checkpointing algorithm for autonomous distributed systems
    Baldoni, R
    Quaglia, F
    Fornara, P
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1999, 10 (02) : 181 - 192
  • [22] An index-based checkpointing/recovery approach for distributed systems
    Gupta, B
    Banerjee, SK
    Wang, Z
    COMPUTERS AND THEIR APPLICATIONS, 2001, : 166 - 170
  • [23] An index-based checkpointing algorithm for autonomous distributed systems
    Baldoni, R
    Quaglia, F
    Fornara, P
    SIXTEENTH SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, PROCEEDINGS, 1997, : 27 - 34
  • [24] A Term-Based Inverted Index Partitioning Model for Efficient Distributed Query Processing
    Cambazoglu, B. Barla
    Kayaaslan, Enver
    Jonassen, Simon
    Aykanat, Cevdet
    ACM TRANSACTIONS ON THE WEB, 2013, 7 (03)
  • [25] DIM: a distributed air index based on MapReduce for spatial query processing in road networks
    Ran Jin
    Gang Chen
    Anthony K. H. Tung
    Lidan Shou
    Beng Chin Ooi
    Yuting Gu
    EURASIP Journal on Wireless Communications and Networking, 2018
  • [26] DIM: a distributed air index based on MapReduce for spatial query processing in road networks
    Jin, Ran
    Chen, Gang
    Tung, Anthony K. H.
    Shou, Lidan
    Ooi, Beng Chin
    Gu, Yuting
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2018,
  • [27] Monte Carlo Query Processing of Uncertain Multidimensional Array Data
    Ge, Tingjian
    Grabiner, David
    Zdonik, Stan
    IEEE 27TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2011), 2011, : 936 - 947
  • [28] DistEQ: Distributed Equality Query Processing on Encoded Bitmap Index
    Keawpibal, Naphat
    Duangsuwan, Jarunee
    Wettayaprasit, Wiphada
    Preechaveerakul, Ladda
    Vanichayobon, Sirirut
    PROCEEDINGS OF THE 2015 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE), 2015, : 309 - 314
  • [29] Assessing Multidimensional Vulnerability of Rural Areas to Flooding:An Index-Based Approach
    Samavia Rasool
    Irfan Ahmad Rana
    Hassam Bin Waseem
    International Journal of Disaster Risk Science, 2024, 15 (01) : 88 - 106
  • [30] Efficient OLAP query processing in distributed data warehouses
    Akinde, M
    Böhlen, M
    Johnson, T
    Lakshmanan, LVS
    Srivastava, D
    18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, : 262 - 262