Image Similarity Search in Large Databases Using a Fast Machine Learning Approach

被引:0
|
作者
Sinjur, Smiljan [1 ]
Zazula, Damjan [1 ]
机构
[1] Univ Maribor, Fac Elect Engn & Comp Sci, SLO-2000 Maribor, Slovenia
关键词
Image similarity; Convex layer; Correlation coefficient; Machine learning; Support vector machine;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Today's tendency to protect various copyrighted multimedia contents, such as text, images or video, resulted in many algorithms for detecting duplicates. If the observed content is identical, then the task is easy. But if the content is even slightly changed, the task to identify the duplicate can be difficult and time consuming. In this paper we develop a fast, two-step algorithm for detecting image duplicates. The algorithm finds also slightly changed images with added noise, translated or scaled content, or images having been compressed and decompressed by various algorithms. The time needed to detect duplicates is kept low by implementing image feature-based searches. To detect all similar images for a given reference image, the feature extraction based on convex layers is deployed. The correlation coefficient between two features gives the first hint of similarity to the user, who creates a learning set for support vector machines by simple on-screen selection.
引用
收藏
页码:85 / 93
页数:9
相关论文
共 50 条
  • [41] Efficient Graph Similarity Search Over Large Graph Databases
    Zheng, Weiguo
    Zou, Lei
    Lian, Xiang
    Wang, Dong
    Zhao, Dongyan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (04) : 964 - 978
  • [42] Efficient similarity search in large databases of tree structured objects
    Kailing, K
    Kriegel, HP
    Schönauer, S
    Seidl, T
    20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, : 835 - 835
  • [43] Efficient Subgraph Similarity Search on Large Probabilistic Graph Databases
    Yuan, Ye
    Wang, Guoren
    Chent, Lei
    Wang, Haixun
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (09): : 800 - 811
  • [44] Effective indexing and filtering for similarity search in large biosequence databases
    Ozturk, O
    Ferhatosmanoglu, H
    THIRD IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING - BIBE 2003, PROCEEDINGS, 2003, : 359 - 366
  • [45] MidiFind: Similarity Search and Popularity Mining in Large MIDI Databases
    Xia, Guangyu
    Huang, Tongbo
    Ma, Yifei
    Dannenberg, Roger
    Faloutsos, Christos
    SOUND, MUSIC, AND MOTION, 2014, 8905 : 259 - 276
  • [46] A multi-step approach for partial similarity search in large image data using histogram intersection
    Kim, CR
    Chung, CW
    INFORMATION AND SOFTWARE TECHNOLOGY, 2003, 45 (04) : 203 - 215
  • [47] Learning feature relevance and similarity metrics in image databases
    Bhanu, B
    Peng, J
    Qing, S
    IEEE WORKSHOP ON CONTENT-BASED ACCESS OF IMAGE AND VIDEO LIBRARIES - PROCEEDINGS, 1998, : 14 - 18
  • [48] SSAHA: A fast search method for large DNA databases
    Ning, ZM
    Cox, AJ
    Mullikin, JC
    GENOME RESEARCH, 2001, 11 (10) : 1725 - 1729
  • [49] Fast similarity search in the presence of longitudinal scaling in time series databases
    Keogh, E
    NINTH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1997, : 578 - 584
  • [50] BTS: a fast approach for similarity search in sequences
    Jin, Bi
    Rong, Gang
    WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 5933 - +