Probabilistic Convex Hull Queries over Uncertain Data

被引:5
|
作者
Yan, Da [1 ]
Zhao, Zhou [1 ]
Ng, Wilfred [1 ]
Liu, Steven [2 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Kowloon, Hong Kong, Peoples R China
[2] SUNY Stony Brook, Dept Comp Sci, Stony Brook, NY USA
关键词
Convex hull; uncertain data; Gibbs sampling;
D O I
10.1109/TKDE.2014.2340408
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The convex hull of a set of two-dimensional points, P, is the minimal convex polygon that contains all the points in P. Convex hull is important in many applications such as GIS, statistical analysis and data mining. Due to the ubiquity of data uncertainty such as location uncertainty in real-world applications, we study the concept of convex hull over uncertain data in 2D space. We propose the Probabilistic Convex Hull (PCH) query and demonstrate its applications, such as Flickr landscape photo extraction and activity region visualization, where location uncertainty is incurred by GPS devices or sensors. To tackle the problem of possible world explosion, we develop an O(N-3) algorithm based on geometric properties, where N is the data size. We further improve this algorithm with spatial indices and effective pruning techniques, which prune the majority of data instances. To achieve better time complexity, we propose another O(N-2 log N) algorithm, by maintaining a probability oracle in the form of a circular array with nice properties. Finally, to support applications that require fast response, we develop a Gibbs-sampling-based approximation algorithm which efficiently finds the PCH with high accuracy. Extensive experiments are conducted to verify the efficiency of our algorithms for answering PCH queries.
引用
收藏
页码:852 / 865
页数:14
相关论文
共 50 条
  • [31] Distributed probabilistic top-k dominating queries over uncertain databases
    Niranjan Rai
    Xiang Lian
    Knowledge and Information Systems, 2023, 65 : 4939 - 4965
  • [32] On Generalized Planar Skyline and Convex Hull Range Queries
    Moidu, Nadeem
    Agarwal, Jatin
    Khare, Sankalp
    Kothapalli, Kishore
    Srinathan, Kannan
    ALGORITHMS AND COMPUTATION, WALCOM 2014, 2014, 8344 : 34 - 43
  • [33] Adaptive Processing for Distributed Skyline Queries over Uncertain Data
    Zhou, Xu
    Li, Kenli
    Zhou, Yantao
    Li, Keqin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (02) : 371 - 384
  • [34] Visible k nearest neighbor queries over uncertain data
    Wang Y.-Q.
    Xu C.-F.
    Yu G.
    Gu Y.
    Chen M.
    Jisuanji Xuebao/Chinese Journal of Computers, 2010, 33 (10): : 1943 - 1952
  • [35] On the Most Likely Convex Hull of Uncertain Points
    Suri, Subhash
    Verbeek, Kevin
    Yildiz, Hakan
    ALGORITHMS - ESA 2013, 2013, 8125 : 791 - 802
  • [36] Human-Powered Data Cleaning for Probabilistic Reachability Queries on Uncertain Graphs
    Lin, Xin
    Peng, Yun
    Xu, Jianliang
    Choi, Byron
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1755 - 1756
  • [37] Human-Powered Data Cleaning for Probabilistic Reachability Queries on Uncertain Graphs
    Lin, Xin
    Peng, Yun
    Choi, Byron
    Xu, Jianliang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (07) : 1452 - 1465
  • [38] Sliding-Window Probabilistic Threshold Aggregate Queries on Uncertain Data Streams
    Chen, Donghui
    Chen, Ling
    INFORMATION SCIENCES, 2020, 520 (520) : 353 - 372
  • [39] Efficiently answering probabilistic threshold top-k queries on uncertain data
    Hua, Ming
    Pei, Jian
    Zhang, Wenjie
    Lin, Xuemin
    2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 1403 - +
  • [40] Evaluating Continuous Probabilistic Queries Over Imprecise Sensor Data
    Zhang, Yinuo
    Cheng, Reynold
    Chen, Jinchuan
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT I, PROCEEDINGS, 2010, 5981 : 535 - +