Image Similarity Search in Large Databases Using a Fast Machine Learning Approach

被引:0
|
作者
Sinjur, Smiljan [1 ]
Zazula, Damjan [1 ]
机构
[1] Univ Maribor, Fac Elect Engn & Comp Sci, SLO-2000 Maribor, Slovenia
关键词
Image similarity; Convex layer; Correlation coefficient; Machine learning; Support vector machine;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Today's tendency to protect various copyrighted multimedia contents, such as text, images or video, resulted in many algorithms for detecting duplicates. If the observed content is identical, then the task is easy. But if the content is even slightly changed, the task to identify the duplicate can be difficult and time consuming. In this paper we develop a fast, two-step algorithm for detecting image duplicates. The algorithm finds also slightly changed images with added noise, translated or scaled content, or images having been compressed and decompressed by various algorithms. The time needed to detect duplicates is kept low by implementing image feature-based searches. To detect all similar images for a given reference image, the feature extraction based on convex layers is deployed. The correlation coefficient between two features gives the first hint of similarity to the user, who creates a learning set for support vector machines by simple on-screen selection.
引用
收藏
页码:85 / 93
页数:9
相关论文
共 50 条
  • [1] Adaptable similarity search in large image databases
    Seidl, T
    Kriegel, HP
    STATE-OF-THE-ART IN CONTENT-BASED IMAGE AND VIDEO RETRIEVAL, 2001, 22 : 297 - 317
  • [2] A multistep approach for shape similarity search in image databases
    Ankerst, M
    Kriegel, HP
    Seidl, T
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1998, 10 (06) : 996 - 1004
  • [3] A fast heuristic algorithm for similarity search in large DNA databases
    Jeong, In-Seon
    Park, Kyoung-Wook
    Lim, Hyeong-Seok
    PROCEEDINGS OF THE FRONTIERS IN THE CONVERGENCE OF BIOSCIENCE AND INFORMATION TECHNOLOGIES, 2007, : 335 - 340
  • [4] A hybrid approach to machine learning annotation of large galaxy image databases
    Kuminski, E.
    Shamir, L.
    ASTRONOMY AND COMPUTING, 2018, 25 : 257 - 269
  • [5] An adaptive index structure for similarity search in large image databases
    Wu, P
    Manjunath, BS
    INTERNET MULTIMEDIA MANAGEMENT SYSTEMS II, 2001, 4519 : 32 - 41
  • [6] Fast similarity search in string databases
    Sheu, S
    Chang, A
    Huang, W
    19TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOL 1, PROCEEDINGS: AINA 2005, 2005, : 617 - 622
  • [7] Indexing scheme for fast similarity search in large time series databases
    Keogh, Eamonn J.
    Pazzani, Michael J.
    Proceedings of the International Conference on Scientific and Statistical Database Management, SSDBM, 1999, : 56 - 67
  • [8] Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases
    Eamonn Keogh
    Kaushik Chakrabarti
    Michael Pazzani
    Sharad Mehrotra
    Knowledge and Information Systems, 2001, 3 (3) : 263 - 286
  • [9] Multiresolution similarity search in image databases
    Heczko, M
    Hinneburg, A
    Keim, D
    Wawryniuk, M
    MULTIMEDIA SYSTEMS, 2004, 10 (01) : 28 - 40
  • [10] Multiresolution similarity search in image databases
    Martin Heczko
    Alexander Hinneburg
    Daniel Keim
    Markus Wawryniuk
    Multimedia Systems, 2004, 10 : 28 - 40