Tensor index for large scale image retrieval

被引：4

作者：

Zheng, Liang ^{[1
]}

Wang, Shengjin ^{[1
]}

Guo, Peizhen ^{[1
]}

Liang, Hanyue ^{[1
]}

Tian, Qi ^{[2
]}

机构：

[1] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China

[2] Univ Texas San Antonio, San Antonio, TX 78249 USA

来源：

MULTIMEDIA SYSTEMS | 2015年 / 21卷 / 06期

基金：

国家高技术研究发展计划(863计划); 美国国家科学基金会;

关键词：

Tensor index; Image retrieval; Bag-of-words model; QUANTIZATION; SIMILARITY;

D O I：

10.1007/s00530-014-0415-8

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recently, the bag-of-words representation is widely applied in the image retrieval applications. In this model, visual word is a core component. However, compared with text retrieval, one major problem associated with image retrieval consists in the visual word ambiguity, i.e., a trade-off between precision and recall of visual matching. To address this problem, this paper proposes a tensor index structure to improve precision and recall simultaneously. Essentially, the tensor index is a multi-dimensional index structure. It combines the strengths of two state-of-the-art indexing strategies, i.e., the inverted multi-index [Babenko and Lempitsky (Computer vision and pattern recognition (CVPR), 2012 IEEE Conference, 3069-3076, 2012)] as well as the joint inverted index [Xia et al. (ICCV, 2013)] which are initially designed for approximate nearest neighbor search problems. This paper, instead, exploits their usage in the scenario of image retrieval and provides insights into how to combine them effectively. We show that on the one hand, the multi-index enhances the discriminative power of visual words, thus improving precision; on the other hand, the introduction of multiple codebooks corrects quantization artifacts, thus improving recall. Extensive experiments on two benchmark datasets demonstrate that tensor index significantly improves the baseline approach. Moreover, when incorporating methods such as Hamming embedding, we achieve competitive performances compared to the state-of-the-art ones.

引用

页码：569 / 579

页数：11

共 50 条

[41] Adaptive relevance feedback for large-scale image retrieval
Nicolae Suditu
François Fleuret
Multimedia Tools and Applications, 2016, 75 : 6777 - 6807
[42] Deep semantic preserving hashing for large scale image retrieval
Zareapoor, Masoumeh
Yang, Jie
Jain, Deepak Kumar
Shamsolmoali, Pourya
Jain, Neha
Kant, Surya
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (17) : 23831 - 23846
[43] Deep semantic preserving hashing for large scale image retrieval
Masoumeh Zareapoor
Jie Yang
Deepak Kumar Jain
Pourya Shamsolmoali
Neha Jain
Surya Kant
Multimedia Tools and Applications, 2019, 78 : 23831 - 23846
[44] Large scale document image retrieval by automatic word annotation
K. Pramod Sankar
R. Manmatha
C. V. Jawahar
International Journal on Document Analysis and Recognition (IJDAR), 2014, 17 : 1 - 17
[45] Region similarity arrangement for large-scale image retrieval
Zhang, Dongming
Tang, Jingya
Jin, Guoqing
Zhang, Yongdong
Tian, Qi
NEUROCOMPUTING, 2018, 272 : 461 - 470
[46] Large-Scale Video Retrieval Using Image Queries
Araujo, Andre
Girod, Bernd
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (06) : 1406 - 1420
[47] Large-scale Image Retrieval based on the Vocabulary Tree
Cheng, Bo
Zhuo, Li
Zhang, Pei
Zhang, Jing
PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2, 2014, : 299 - 304
[48] Large-Scale Image Retrieval with Compressed Fisher Vectors
Perronnin, Florent
Liu, Yan
Sanchez, Jorge
Poirier, Herve
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 3384 - 3391
[49] Multiple hierarchical deep hashing for large scale image retrieval
Liangfu Cao
Lianli Gao
Jingkuan Song
Fumin Shen
Yuan Wang
Multimedia Tools and Applications, 2018, 77 : 10471 - 10484
[50] Cascaded Deep Hashing for Large-Scale Image Retrieval
Lu, Jun
Zhang, Li
NEURAL INFORMATION PROCESSING (ICONIP 2018), PT VI, 2018, 11306 : 419 - 429

← 1 2 3 4 5 →