Large-scale duplicate detection for web image search

被引:42
|
作者
Wang, Bin [1 ]
Li, Zhiwei [2 ]
Li, Mingjing [2 ]
Ma, Wei-Ying [2 ]
机构
[1] Univ Sci & Technol China, Hefei 230026, Peoples R China
[2] Microsoft Res Asia, Beijing 100080, Peoples R China
关键词
D O I
10.1109/ICME.2006.262509
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Finding visually identical images in large image collections is important for many applications such as intelligence propriety protection and search result presentation. Several algorithms have been reported in the literature, but they are not suitable for large image collections. In this paper, a novel algorithm is proposed to handle the situation, in which each image is compactly represented by a hash code. To detect duplicate images, only the hash codes are required. In addition, a very efficient search method is implemented to quickly group images with similar hash codes for fast detection. The experiments show that our algorithm can be both efficient and effective for duplicate detection in web image search.
引用
收藏
页码:353 / +
页数:2
相关论文
共 50 条
  • [41] Semantifying queries over large-scale Web search engines
    Papadakis, Ioannis
    Stefanidakis, Michalis
    Stamou, Sofia
    Andreou, Ioannis
    JOURNAL OF INTERNET SERVICES AND APPLICATIONS, 2012, 3 (03) : 255 - 268
  • [42] A Framework for Large-Scale Detection of Web Site Defacements
    Bartoli, Alberto
    Davanzo, Giorgio
    Medvet, Eric
    ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2010, 10 (03)
  • [43] Sub-Selective Quantization for Large-Scale Image Search
    Li, Yeqing
    Chen, Chen
    Liu, Wei
    Huang, Junzhou
    PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 2803 - 2809
  • [44] Erratum to: Real-time, large-scale duplicate image detection method based on multi-feature fusion
    Ming Chen
    Yuhua Li
    Zhifeng Zhang
    Ching-Hsien Hsu
    Shangguang Wang
    Journal of Real-Time Image Processing, 2019, 16 : 1881 - 1881
  • [45] Simultaneous Feature Aggregating and Hashing for Large-scale Image Search
    Thanh-Toan Do
    Dang-Khoa Le Tan
    Pham, Trung T.
    Cheung, Ngai-Man
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4217 - 4226
  • [46] Large-scale image and video search: Challenges, technologies, and trends
    Wang, Meng
    Sebe, Nicu
    Mei, Tao
    Li, Jia
    Aizawa, Kiyoharu
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2010, 21 (08) : 771 - 772
  • [47] Image Models for large-scale Object Detection and Classification
    Kralev, Jordan
    Koeva, Svetla
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE COMPUTATIONAL LINGUISTICS IN BULGARIA, CLIB 2022, 2022, : 190 - 201
  • [48] Parallel AP Clustering and Re-ranking for Automatic Image-Text Alignment and Large-Scale Web Image Search
    Qu, Yanyun
    Zhang, Baopeng
    Fan, Jianping
    ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 451 - 454
  • [49] Duplicate image detection in a stream of web visual data
    Gadeski, Etienne
    Le Borgne, Herve
    Popescu, Adrian
    2015 13TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2015,
  • [50] Multiple Distance-Based Coding: Toward Scalable Feature Matching for Large-Scale Web Image Search
    Zhou, Zhili
    Wu, Q. M. Jonathan
    Sun, Xingming
    IEEE TRANSACTIONS ON BIG DATA, 2021, 7 (03) : 559 - 573