Large-scale duplicate detection for web image search

被引:42
|
作者
Wang, Bin [1 ]
Li, Zhiwei [2 ]
Li, Mingjing [2 ]
Ma, Wei-Ying [2 ]
机构
[1] Univ Sci & Technol China, Hefei 230026, Peoples R China
[2] Microsoft Res Asia, Beijing 100080, Peoples R China
关键词
D O I
10.1109/ICME.2006.262509
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Finding visually identical images in large image collections is important for many applications such as intelligence propriety protection and search result presentation. Several algorithms have been reported in the literature, but they are not suitable for large image collections. In this paper, a novel algorithm is proposed to handle the situation, in which each image is compactly represented by a hash code. To detect duplicate images, only the hash codes are required. In addition, a very efficient search method is implemented to quickly group images with similar hash codes for fast detection. The experiments show that our algorithm can be both efficient and effective for duplicate detection in web image search.
引用
收藏
页码:353 / +
页数:2
相关论文
共 50 条
  • [31] Real-time, large-scale duplicate image detection method based on multi-feature fusion
    Chen, Ming
    Li, Yuhua
    Zhang, Zhifeng
    Hsu, Ching-Hsien
    Wang, Shangguang
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2017, 13 (03) : 557 - 570
  • [32] A query-dependent duplicate detection approach for large scale search engines
    Ye, SZ
    Song, RH
    Wen, JR
    Ma, WY
    ADVANCED WEB TECHNOLOGIES AND APPLICATIONS, 2004, 3007 : 48 - 58
  • [33] Fast and robust duplicate image detection on the web
    Gadeski, Etienne
    Le Borgne, Herve
    Popescu, Adrian
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (09) : 11839 - 11858
  • [34] Fast and robust duplicate image detection on the web
    Etienne Gadeski
    Hervé Le Borgne
    Adrian Popescu
    Multimedia Tools and Applications, 2017, 76 : 11839 - 11858
  • [35] Scalability and Efficiency Challenges in Large-Scale Web Search Engines
    Baeza-Yates, Ricardo
    Cambazoglu, B. Barla
    WWW'14 COMPANION: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2014, : 185 - 186
  • [36] A Query Construction Service for large-scale Web Search Engines
    Papadakis, Ioannis
    Stefanidakis, Michalis
    Stamou, Sofia
    Andreou, Ioannis
    2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 3, 2009, : 96 - +
  • [37] Scalability and Efficiency Challenges in Large-Scale Web Search Engines
    Barla Cambazoglu, B.
    Baeza-Yates, Ricardo
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 1285 - 1285
  • [38] Scalability and Efficiency Challenges in Large-Scale Web Search Engines
    Barla Cambazoglu, B.
    Baeza-Yates, Ricardo
    WSDM'15: PROCEEDINGS OF THE EIGHTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2015, : 411 - 412
  • [39] A hierarchical cache scheme for the large-scale web search engine
    Lim, Sungchae
    Ahn, Joonseon
    PROCEEDINGS OF NINTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING, 2008, : 925 - +
  • [40] Scalability and Efficiency Challenges in Large-Scale Web Search Engines
    Barla Cambazoglu, B.
    Baeza-Yates, Ricardo
    SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 1223 - 1226