Region-aware Top-k Similarity Search

被引:1
|
作者
Liu, Sitong [1 ]
Feng, Jianhua [1 ]
Wu, Yongwei [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
关键词
EFFICIENT;
D O I
10.1007/978-3-319-21042-1_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Location-based services have attracted significant attention for the ubiquitous smartphones equipped with GPS systems. These services (e.g., Google map, Twitter) generate large amounts of spatio-textual data which contain both geographical location and textual description. Existing location-based services (LBS) assume that the attractiveness of a Point-of-Interest (POI) depends on its spatial proximity from people. However, in most cases, POIs within a certain distance are all acceptable to users and people may concern more about other aspects. In this paper, we study a region-aware top-k similarity search problem: given a set of spatio-textual objects, a spatial region and several input tokens, finds k most textual-relevant objects falling in this region. We summarize our main contributions as follows: (1) We propose a hybrid-landmark index which integrates the spatial and textual pruning seamlessly. (2) We explore a priority-based algorithm and extend it to support fuzzy-token distance. (3) We devise a cost model to evaluate the landmark quality and propose a deletion-based method to generate high quality landmarks (4) Extensive experiments show that our method outperforms state-of-the-art algorithms and achieves high performance.
引用
收藏
页码:387 / 399
页数:13
相关论文
共 50 条
  • [21] Top-k Set Similarity Joins
    Xiao, Chuan
    Wang, Wei
    Lin, Xuemin
    Shang, Haichuan
    ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2009, : 916 - +
  • [22] Efficient top-k similarity document search utilizing distributed file systems and cosine similarity
    Mahmoud Alewiwi
    Cengiz Orencik
    Erkay Savaş
    Cluster Computing, 2016, 19 : 109 - 126
  • [23] Efficient top-k similarity document search utilizing distributed file systems and cosine similarity
    Alewiwi, Mahmoud
    Orencik, Cengiz
    Savas, Erkay
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2016, 19 (01): : 109 - 126
  • [24] Bidirectional String Anchors for Improved Text Indexing and Top-K Similarity Search
    Loukides, Grigorios
    Pissis, Solon P.
    Sweering, Michelle
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (11) : 11093 - 11111
  • [25] Diversified Top-K Clique Search
    Yuan, Long
    Qin, Lu
    Lin, Xuemin
    Chang, Lijun
    Zhang, Wenjie
    2015 IEEE 31ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2015, : 387 - 398
  • [26] Efficient Top-k Search for PageRank
    Fujiwara, Yasuhiro
    Nakatsuji, Makoto
    Shiokawa, Hiroaki
    Mishima, Takeshi
    Onizuka, Makoto
    Transactions of the Japanese Society for Artificial Intelligence, 2015, 30 (02) : 473 - 478
  • [27] Top-k Search in Product Catalogues
    Sumak, Martin
    Gursky, Peter
    DATESO 2011: DATABASES, TEXTS, SPECIFICATIONS, OBJECTS, 2011, 706 : 1 - 12
  • [28] Diversified top-k clique search
    Long Yuan
    Lu Qin
    Xuemin Lin
    Lijun Chang
    Wenjie Zhang
    The VLDB Journal, 2016, 25 : 171 - 196
  • [29] Diversified top-k clique search
    Yuan, Long
    Qin, Lu
    Lin, Xuemin
    Chang, Lijun
    Zhang, Wenjie
    VLDB JOURNAL, 2016, 25 (02): : 171 - 196
  • [30] Interactive Search for One of the Top-k
    Wang, Weicheng
    Wong, Raymond Chi-Wing
    Xie, Min
    SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 1920 - 1932