Efficient batch similarity join processing of social images based on arbitrary features

被引:0
|
作者
Yi Zhuang
Nan Jiang
Zhi-Ang Wu
Jie Cao
Chunhua Ju
机构
[1] Zhejiang Gongshang University,College of Computer and Information Engineering
[2] Hangzhou First People’s Hospital,Jiangsu Provincial Key Laboratory of E
[3] Nanjing University of Finance and Economics,Business
来源
World Wide Web | 2016年 / 19卷
关键词
Social image; High-dimensional indexing; Join box; Batch similarity join;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we identify and solve a multi-join optimization problem for Arbitrary Feature-based social image Similarity JOINs(AFS-JOIN). Given two collections(i.e., R and S) of social images that carry both visual, spatial and textual(i.e., tag) information, the multiple joins based on arbitrary features retrieves the pairs of images that are visually, textually similar or spatially close from different users. To address this problem, in this paper, we have proposed three methods to facilitate the multi-join processing: 1) two baseline approaches(i.e., a naïve join approach and a maximal threshold(MT)-based), and 2) aBatch Similarity Join(BSJ) method. For the BSJ method, given m users’ join requests, they are first conversed and grouped into m″ clusters which correspond to m″ join boxes, where m > m″. To speedup the BSJ processing, a feature distance space is first partitioned into some cubes based on four segmentation schemes; the image pairs falling in the cubes are indexed by the cube tree index; thus BSJ processing is transformed into the searching of the image pairs falling in some affected cubes for m″ AFS-JOINs with the aid of the index. An extensive experimental evaluation using real and synthetic datasets shows that our proposed BSJ technique outperforms the state-of-the-art solutions.
引用
收藏
页码:725 / 753
页数:28
相关论文
共 50 条
  • [21] Efficient join-index-based spatial-join processing: A clustering approach
    Shekhar, S
    Lu, CT
    Chawla, S
    Ravada, S
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2002, 14 (06) : 1400 - 1421
  • [22] Browsing images based on social and content similarity
    Tatemura, J
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1567 - 1570
  • [23] Wavelet features for similarity based retrieval of logo images
    Jaisimha, HY
    DOCUMENT RECOGNITION III, 1996, 2660 : 89 - 100
  • [24] Efficient join query processing algorithm CHMJ based on hadoop
    Zhao, Yan-Rong
    Wang, Wei-Ping
    Meng, Dan
    Zhang, Shu-Bin
    Li, Jun
    Ruan Jian Xue Bao/Journal of Software, 2012, 23 (08): : 2032 - 2041
  • [25] StdSort: Efficient Pre-Processing for Faster Vector Similarity Join Using Standard Deviation
    Kim, Hyun Joon
    Lee, Sang-goo
    ACM IMCOM 2015, Proceedings, 2015,
  • [26] Large-Scale Similarity-Based Join Processing in Multimedia Databases
    Kosch, Harald
    Woelfl, Andreas
    ADVANCES IN MULTIMEDIA MODELING, 2012, 7131 : 418 - 428
  • [27] Efficient top-k similarity join processing over multi-valued objects
    Zhang, Wenjie
    Zhan, Liming
    Zhang, Ying
    Cheema, Muhammad Aamir
    Lin, Xuemin
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2014, 17 (03): : 285 - 309
  • [28] Efficient top-k similarity join processing over multi-valued objects
    Wenjie Zhang
    Liming Zhan
    Ying Zhang
    Muhammad Aamir Cheema
    Xuemin Lin
    World Wide Web, 2014, 17 : 285 - 309
  • [29] Efficient Similarity Join Based on Earth Mover's Distance Using MapReduce
    Xu, Jia
    Lei, Bin
    Gu, Yu
    Winslett, Marianne
    Yu, Ge
    Zhang, Zhenjie
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1456 - 1457
  • [30] Efficient Similarity Join Based on Earth Mover's Distance Using MapReduce
    Xu, Jia
    Lei, Bin
    Gu, Yu
    Winslett, Marianne
    Yu, Ge
    Zhang, Zhenjie
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (08) : 2148 - 2162