Scalable Discrete Matrix Factorization and Semantic Autoencoder for Cross-Media Retrieval

被引：27

作者：

Zhang, Donglin ^{[1
,2
]}

Wu, Xiao-Jun ^{[1
,2
]}

机构：

[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Jiangsu, Peoples R China

[2] Jiangnan Univ, Jiangsu Prov Engn Lab Pattern Recognit & Computat, Wuxi 214122, Jiangsu, Peoples R China

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2022年 / 52卷 / 07期

基金：

中国国家自然科学基金;

关键词：

Autoencoder; cross-modal retrieval; hashing; NEAREST-NEIGHBOR; BINARY-CODES; IMAGE SEARCH; QUANTIZATION;

D O I：

10.1109/TCYB.2020.3032017

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Hashing methods have sparked great attention on multimedia tasks due to their effectiveness and efficiency. However, most existing methods generate binary codes by relaxing the binary constraints, which may cause large quantization error. In addition, most supervised cross-modal approaches preserve the similarity relationship by constructing an n x n large-size similarity matrix, which requires huge computation, making these methods unscalable. To address the above challenges, this article presents a novel algorithm, called scalable discrete matrix factorization and semantic autoencoder method (SDMSA). SDMSA is a two-stage method. In the first stage, the matrix factorization scheme is utilized to learn the latent semantic information, the label matrix is incorporated into the loss function instead of the similarity matrix. Thereafter, the binary codes can be generated by the latent representations. During optimization, we can avoid manipulating a large nxn similarity matrix, and the hash codes can be generated directly. In the second stage, a novel hash function learning scheme based on the autoencoder is proposed. The encoder-decoder paradigm aims to learn projections, the feature vectors are projected to code vectors by encoder, and the code vectors are projected back to the original feature vectors by the decoder. The encoder-decoder scheme ensures the embedding can well preserve both the semantic and feature information. Specifically, two algorithms SDMSA-lin and SDMSA-ker are developed under the SDMSA framework. Owing to the merit of SDMSA, we can get more semantically meaningful binary hash codes. Extensive experiments on several databases show that SDMSA-lin and SDMSA-ker achieve promising performance.

引用

页码：5947 / 5960

页数：14

共 50 条

[1] Discrete Semantic Alignment Hashing for Cross-Media Retrieval
Yao, Tao
Kong, Xiangwei
Fu, Haiyan
Tian, Qi
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (12) : 4896 - 4907
[2] Discrete Robust Matrix Factorization Hashing for Large-Scale Cross-Media Retrieval
Yao, Tao
Li, Yiru
Guan, Weili
Wang, Gang
Li, Ying
Yan, Lianshan
Tian, Qi
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (02) : 1391 - 1401
[3] Discrete Bidirectional Matrix Factorization Hashing for Zero-Shot Cross-Media Retrieval
Zhang, Donglin
Wu, Xiao-Jun
Yu, Jun
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 524 - 536
[4] Semantic convex matrix factorisation for cross-media retrieval
Fang, Yixian
Ren, Yuwei
Zhang, Huaxiang
IET IMAGE PROCESSING, 2019, 13 (01) : 196 - 205
[5] Towards Private and Scalable Cross-Media Retrieval
Hu, Shengshan
Zhang, Leo Yu
Wang, Qian
Qin, Zhan
Wang, Cong
IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2021, 18 (03) : 1354 - 1368
[6] Discrete Semantic Matrix Factorization Hashing for Cross-Modal Retrieval
Qin, Jianyang
Fei, Lunke
Teng, Shaohua
Zhang, Wei
Liu, Dongning
Zhao, Genping
Yuan, Haoliang
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1550 - 1557
[7] Learning semantic correlations for cross-media retrieval
Wu, Fei
Zhang, Hong
Zhuang, Yueting
2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 1465 - +
[8] Discovering Semantic Vocabularies for Cross-Media Retrieval
Habibian, Amirhossein
Mensink, Thomas
Snoek, Cees G. M.
ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 131 - 138
[9] SCRATCH: A Scalable Discrete Matrix Factorization Hashing for Cross-Modal Retrieval
Li, Chuan-Xiang
Chen, Zhen-Duo
Zhang, Peng-Fei
Luo, Xin
Nie, Liqiang
Zhang, Wei
Xu, Xin-Shun
PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1 - 9
[10] Online Collective Matrix Factorization Hashing for Large-Scale Cross-Media Retrieval
Wang, Di
Wang, Quan
An, Yaqiang
Gao, Xinbo
Tian, Yumin
PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1409 - 1418

← 1 2 3 4 5 →