Large-scale duplicate detection for web image search

被引:42
|
作者
Wang, Bin [1 ]
Li, Zhiwei [2 ]
Li, Mingjing [2 ]
Ma, Wei-Ying [2 ]
机构
[1] Univ Sci & Technol China, Hefei 230026, Peoples R China
[2] Microsoft Res Asia, Beijing 100080, Peoples R China
关键词
D O I
10.1109/ICME.2006.262509
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Finding visually identical images in large image collections is important for many applications such as intelligence propriety protection and search result presentation. Several algorithms have been reported in the literature, but they are not suitable for large image collections. In this paper, a novel algorithm is proposed to handle the situation, in which each image is compactly represented by a hash code. To detect duplicate images, only the hash codes are required. In addition, a very efficient search method is implemented to quickly group images with similar hash codes for fast detection. The experiments show that our algorithm can be both efficient and effective for duplicate detection in web image search.
引用
收藏
页码:353 / +
页数:2
相关论文
共 50 条
  • [21] BertLoc: Duplicate Location Record Detection in a Large-Scale Location Dataset
    Park, Sujin
    Lee, Sangwon
    Woo, Simon S.
    36TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2021, 2021, : 942 - 951
  • [22] Efficient Feature Detection and Effective Post-Verification for Large Scale Near-Duplicate Image Search
    Xie, Hongtao
    Gao, Ke
    Zhang, Yongdong
    Tang, Sheng
    Li, Jintao
    Liu, Yizhi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 13 (06) : 1319 - 1332
  • [23] Duplicate-Search-Based Image Annotation Using Web-Scale Data
    Wang, Xin-Jing
    Zhang, Lei
    Ma, Wei-Ying
    PROCEEDINGS OF THE IEEE, 2012, 100 (09) : 2705 - 2721
  • [24] A systematic study on parameter correlations in large-scale duplicate document detection
    Ye, Shaozhi
    Wen, Ji-Rong
    Ma, Wei-Ying
    KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 14 (02) : 217 - 232
  • [25] Large-scale near-duplicate image retrieval by kernel density estimation
    Tong, Wei
    Li, Fengjie
    Jin, Rong
    Jain, Anil
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2012, 1 (01) : 45 - 58
  • [26] Large-scale near-duplicate image retrieval by kernel density estimation
    Wei Tong
    Fengjie Li
    Rong Jin
    Anil Jain
    International Journal of Multimedia Information Retrieval, 2012, 1 (1) : 45 - 58
  • [27] Large-scale image search using region division
    Rao, Yunbo
    Liu, Wei
    Pu, Jiansu
    Wang, Zheng
    Wang, Qifei
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW 2019), 2019, : 326 - 330
  • [28] VisualRank: Applying PageRank to large-scale image search
    Jing, Yushi
    Baluja, Shumeet
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (11) : 1877 - 1890
  • [29] Efficient Large-scale Image Search with a Vocabulary Tree
    Uriza, Esteban
    Gomez-Fernandez, Francisco
    Rais, Martin
    IMAGE PROCESSING ON LINE, 2018, 8 : 71 - 98
  • [30] Real-time, large-scale duplicate image detection method based on multi-feature fusion
    Ming Chen
    Yuhua Li
    Zhifeng Zhang
    Ching-Hsien Hsu
    Shangguang Wang
    Journal of Real-Time Image Processing, 2017, 13 : 557 - 570