Scalable Discrete Matrix Factorization and Semantic Autoencoder for Cross-Media Retrieval

被引：27

作者：

Zhang, Donglin ^{[1
,2
]}

Wu, Xiao-Jun ^{[1
,2
]}

机构：

[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Jiangsu, Peoples R China

[2] Jiangnan Univ, Jiangsu Prov Engn Lab Pattern Recognit & Computat, Wuxi 214122, Jiangsu, Peoples R China

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2022年 / 52卷 / 07期

基金：

中国国家自然科学基金;

关键词：

Autoencoder; cross-modal retrieval; hashing; NEAREST-NEIGHBOR; BINARY-CODES; IMAGE SEARCH; QUANTIZATION;

D O I：

10.1109/TCYB.2020.3032017

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Hashing methods have sparked great attention on multimedia tasks due to their effectiveness and efficiency. However, most existing methods generate binary codes by relaxing the binary constraints, which may cause large quantization error. In addition, most supervised cross-modal approaches preserve the similarity relationship by constructing an n x n large-size similarity matrix, which requires huge computation, making these methods unscalable. To address the above challenges, this article presents a novel algorithm, called scalable discrete matrix factorization and semantic autoencoder method (SDMSA). SDMSA is a two-stage method. In the first stage, the matrix factorization scheme is utilized to learn the latent semantic information, the label matrix is incorporated into the loss function instead of the similarity matrix. Thereafter, the binary codes can be generated by the latent representations. During optimization, we can avoid manipulating a large nxn similarity matrix, and the hash codes can be generated directly. In the second stage, a novel hash function learning scheme based on the autoencoder is proposed. The encoder-decoder paradigm aims to learn projections, the feature vectors are projected to code vectors by encoder, and the code vectors are projected back to the original feature vectors by the decoder. The encoder-decoder scheme ensures the embedding can well preserve both the semantic and feature information. Specifically, two algorithms SDMSA-lin and SDMSA-ker are developed under the SDMSA framework. Owing to the merit of SDMSA, we can get more semantically meaningful binary hash codes. Extensive experiments on several databases show that SDMSA-lin and SDMSA-ker achieve promising performance.

引用

页码：5947 / 5960

页数：14

共 50 条

[21] Supervised Robust Discrete Multimodal Hashing for Cross-Media Retrieval
Yan, Ting-Kun
Xu, Xin-Shun
Guo, Shanqing
Huang, Zi
Wang, Xiao-Lin
CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1271 - 1280
[22] Supervised Robust Discrete Multimodal Hashing for Cross-Media Retrieval
Li, Chuan-Xiang
Yan, Ting-Kun
Luo, Xin
Nie, Liqiang
Xu, Xin-Shun
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (11) : 2863 - 2877
[23] Online semantic embedding correlation for discrete cross-media hashing
Yang, Fan
Hu, Haoyu
Ma, Fumin
Ding, Xiaojian
Zhang, Qiaoxi
Liu, Xinqi
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 272
[24] DAH: Discrete Asymmetric Hashing for Efficient Cross-Media Retrieval
Zhang, Donglin
Wu, Xiao-Jun
Xu, Tianyang
Yin, He-Feng
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (02) : 1365 - 1378
[25] Supervised Coarse-to-Fine Semantic Hashing for cross-media retrieval
Yao, Tao
Kong, Xiangwei
Fu, Haiyan
Tian, Qi
DIGITAL SIGNAL PROCESSING, 2017, 63 : 135 - 144
[26] Semi-Supervised Learning Based Semantic Cross-Media Retrieval
Zheng, Xiyuan
Zhu, Wei
Yu, Zhenmei
Zhang, Meijia
IEEE ACCESS, 2021, 9 : 75049 - 75057
[27] Mining semantic correlation of heterogeneous multimedia data for cross-media retrieval
Zhuang, Yue-Ting
Yang, Yi
Wu, Fei
IEEE TRANSACTIONS ON MULTIMEDIA, 2008, 10 (02) : 221 - 229
[28] Dictionary Learning based Supervised Discrete Hashing for Cross-Media Retrieval
Wu, Ye
Luo, Xin
Xu, Xin-Shun
Guo, Shanqing
Shi, Yuliang
ICMR '18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2018, : 222 - 230
[29] Discrete matrix factorization hashing for cross-modal retrieval
Fang, Xiaozhao
Liu, Zhihu
Han, Na
Jiang, Lin
Teng, Shaohua
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (10) : 3023 - 3036
[30] Discrete matrix factorization hashing for cross-modal retrieval
Xiaozhao Fang
Zhihu Liu
Na Han
Lin Jiang
Shaohua Teng
International Journal of Machine Learning and Cybernetics, 2021, 12 : 3023 - 3036

← 1 2 3 4 5 →