Image Similarity Search in Large Databases Using a Fast Machine Learning Approach

被引：0

作者：

Sinjur, Smiljan ^{[1
]}

Zazula, Damjan ^{[1
]}

机构：

[1] Univ Maribor, Fac Elect Engn & Comp Sci, SLO-2000 Maribor, Slovenia

来源：

NEW DIRECTIONS IN INTELLIGENT INTERACTIVE MULTIMEDIA | 2008年 / 142卷

关键词：

Image similarity; Convex layer; Correlation coefficient; Machine learning; Support vector machine;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Today's tendency to protect various copyrighted multimedia contents, such as text, images or video, resulted in many algorithms for detecting duplicates. If the observed content is identical, then the task is easy. But if the content is even slightly changed, the task to identify the duplicate can be difficult and time consuming. In this paper we develop a fast, two-step algorithm for detecting image duplicates. The algorithm finds also slightly changed images with added noise, translated or scaled content, or images having been compressed and decompressed by various algorithms. The time needed to detect duplicates is kept low by implementing image feature-based searches. To detect all similar images for a given reference image, the feature extraction based on convex layers is deployed. The correlation coefficient between two features gives the first hint of similarity to the user, who creates a learning set for support vector machines by simple on-screen selection.

引用

页码：85 / 93

页数：9

共 50 条

[41] Efficient Graph Similarity Search Over Large Graph Databases
Zheng, Weiguo
Zou, Lei
Lian, Xiang
Wang, Dong
Zhao, Dongyan
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (04) : 964 - 978
[42] Efficient similarity search in large databases of tree structured objects
Kailing, K
Kriegel, HP
Schönauer, S
Seidl, T
20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, : 835 - 835
[43] Efficient Subgraph Similarity Search on Large Probabilistic Graph Databases
Yuan, Ye
Wang, Guoren
Chent, Lei
Wang, Haixun
PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (09): : 800 - 811
[44] Effective indexing and filtering for similarity search in large biosequence databases
Ozturk, O
Ferhatosmanoglu, H
THIRD IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING - BIBE 2003, PROCEEDINGS, 2003, : 359 - 366
[45] MidiFind: Similarity Search and Popularity Mining in Large MIDI Databases
Xia, Guangyu
Huang, Tongbo
Ma, Yifei
Dannenberg, Roger
Faloutsos, Christos
SOUND, MUSIC, AND MOTION, 2014, 8905 : 259 - 276
[46] A multi-step approach for partial similarity search in large image data using histogram intersection
Kim, CR
Chung, CW
INFORMATION AND SOFTWARE TECHNOLOGY, 2003, 45 (04) : 203 - 215
[47] Learning feature relevance and similarity metrics in image databases
Bhanu, B
Peng, J
Qing, S
IEEE WORKSHOP ON CONTENT-BASED ACCESS OF IMAGE AND VIDEO LIBRARIES - PROCEEDINGS, 1998, : 14 - 18
[48] SSAHA: A fast search method for large DNA databases
Ning, ZM
Cox, AJ
Mullikin, JC
GENOME RESEARCH, 2001, 11 (10) : 1725 - 1729
[49] Fast similarity search in the presence of longitudinal scaling in time series databases
Keogh, E
NINTH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1997, : 578 - 584
[50] BTS: a fast approach for similarity search in sequences
Jin, Bi
Rong, Gang
WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 5933 - +

← 1 2 3 4 5 →