Probabilistic Convex Hull Queries over Uncertain Data

被引:5
|
作者
Yan, Da [1 ]
Zhao, Zhou [1 ]
Ng, Wilfred [1 ]
Liu, Steven [2 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Kowloon, Hong Kong, Peoples R China
[2] SUNY Stony Brook, Dept Comp Sci, Stony Brook, NY USA
关键词
Convex hull; uncertain data; Gibbs sampling;
D O I
10.1109/TKDE.2014.2340408
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The convex hull of a set of two-dimensional points, P, is the minimal convex polygon that contains all the points in P. Convex hull is important in many applications such as GIS, statistical analysis and data mining. Due to the ubiquity of data uncertainty such as location uncertainty in real-world applications, we study the concept of convex hull over uncertain data in 2D space. We propose the Probabilistic Convex Hull (PCH) query and demonstrate its applications, such as Flickr landscape photo extraction and activity region visualization, where location uncertainty is incurred by GPS devices or sensors. To tackle the problem of possible world explosion, we develop an O(N-3) algorithm based on geometric properties, where N is the data size. We further improve this algorithm with spatial indices and effective pruning techniques, which prune the majority of data instances. To achieve better time complexity, we propose another O(N-2 log N) algorithm, by maintaining a probability oracle in the form of a circular array with nice properties. Finally, to support applications that require fast response, we develop a Gibbs-sampling-based approximation algorithm which efficiently finds the PCH with high accuracy. Extensive experiments are conducted to verify the efficiency of our algorithms for answering PCH queries.
引用
收藏
页码:852 / 865
页数:14
相关论文
共 50 条
  • [41] Queries over Unstructured Data: Probabilistic Methods to the Rescue (Keynote)
    Sarawagi, Sunita
    ENABLING REAL-TIME BUSINESS INTELLIGENCE, 2010, 41 : 1 - 13
  • [42] Probabilistic skyline queries on uncertain time series
    He, Guoliang
    Chen, Lu
    Zeng, Chen
    Zheng, Qiaoxian
    Zhou, Guofu
    NEUROCOMPUTING, 2016, 191 : 224 - 237
  • [43] Probabilistic inverse ranking queries in uncertain databases
    Lian, Xiang
    Chen, Lei
    VLDB JOURNAL, 2011, 20 (01): : 107 - 127
  • [44] Probabilistic inverse ranking queries in uncertain databases
    Xiang Lian
    Lei Chen
    The VLDB Journal, 2011, 20 : 107 - 127
  • [45] Probabilistic Threshold Join over Distributed Uncertain Data
    Deng, Lei
    Wang, Fei
    Huang, Benxiong
    WEB-AGE INFORMATION MANAGEMENT, 2011, 6897 : 68 - 80
  • [46] Evaluating Probabilistic Spatial-Range Closest Pairs Queries over Uncertain Objects
    Chen, Mo
    Jia, Zixi
    Gu, Yu
    Yu, Ge
    Li, Chuanwen
    WEB-AGE INFORMATION MANAGEMENT, 2011, 6897 : 602 - 613
  • [47] Ontology-mediated Queries over Probabilistic Data via Probabilistic Logic Programming
    van Bremen, Timothy
    Dries, Anton
    Jung, Jean Christoph
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2437 - 2440
  • [48] GDPS: An Efficient Approach for Skyline Queries over Distributed Uncertain Data
    Li, Xiaoyong
    Wang, Yijie
    Li, Xiaoling
    Wang, Xiaowei
    yu, Jie
    BIG DATA RESEARCH, 2014, 1 (01) : 23 - 36
  • [49] Efficient and Progressive Algorithms for Distributed Skyline Queries over Uncertain Data
    Ding, Xiaofeng
    Jin, Hai
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (08) : 1448 - 1462
  • [50] Efficient and Progressive Algorithms for Distributed Skyline Queries over Uncertain Data
    Ding, Xiaofeng
    Jin, Hai
    2010 INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS ICDCS 2010, 2010,