Scalable Discrete Matrix Factorization and Semantic Autoencoder for Cross-Media Retrieval

被引:27
|
作者
Zhang, Donglin [1 ,2 ]
Wu, Xiao-Jun [1 ,2 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Jiangsu, Peoples R China
[2] Jiangnan Univ, Jiangsu Prov Engn Lab Pattern Recognit & Computat, Wuxi 214122, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Autoencoder; cross-modal retrieval; hashing; NEAREST-NEIGHBOR; BINARY-CODES; IMAGE SEARCH; QUANTIZATION;
D O I
10.1109/TCYB.2020.3032017
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hashing methods have sparked great attention on multimedia tasks due to their effectiveness and efficiency. However, most existing methods generate binary codes by relaxing the binary constraints, which may cause large quantization error. In addition, most supervised cross-modal approaches preserve the similarity relationship by constructing an n x n large-size similarity matrix, which requires huge computation, making these methods unscalable. To address the above challenges, this article presents a novel algorithm, called scalable discrete matrix factorization and semantic autoencoder method (SDMSA). SDMSA is a two-stage method. In the first stage, the matrix factorization scheme is utilized to learn the latent semantic information, the label matrix is incorporated into the loss function instead of the similarity matrix. Thereafter, the binary codes can be generated by the latent representations. During optimization, we can avoid manipulating a large nxn similarity matrix, and the hash codes can be generated directly. In the second stage, a novel hash function learning scheme based on the autoencoder is proposed. The encoder-decoder paradigm aims to learn projections, the feature vectors are projected to code vectors by encoder, and the code vectors are projected back to the original feature vectors by the decoder. The encoder-decoder scheme ensures the embedding can well preserve both the semantic and feature information. Specifically, two algorithms SDMSA-lin and SDMSA-ker are developed under the SDMSA framework. Owing to the merit of SDMSA, we can get more semantically meaningful binary hash codes. Extensive experiments on several databases show that SDMSA-lin and SDMSA-ker achieve promising performance.
引用
收藏
页码:5947 / 5960
页数:14
相关论文
共 50 条
  • [1] Discrete Semantic Alignment Hashing for Cross-Media Retrieval
    Yao, Tao
    Kong, Xiangwei
    Fu, Haiyan
    Tian, Qi
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (12) : 4896 - 4907
  • [2] Discrete Robust Matrix Factorization Hashing for Large-Scale Cross-Media Retrieval
    Yao, Tao
    Li, Yiru
    Guan, Weili
    Wang, Gang
    Li, Ying
    Yan, Lianshan
    Tian, Qi
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (02) : 1391 - 1401
  • [3] Discrete Bidirectional Matrix Factorization Hashing for Zero-Shot Cross-Media Retrieval
    Zhang, Donglin
    Wu, Xiao-Jun
    Yu, Jun
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 524 - 536
  • [4] Semantic convex matrix factorisation for cross-media retrieval
    Fang, Yixian
    Ren, Yuwei
    Zhang, Huaxiang
    IET IMAGE PROCESSING, 2019, 13 (01) : 196 - 205
  • [5] Towards Private and Scalable Cross-Media Retrieval
    Hu, Shengshan
    Zhang, Leo Yu
    Wang, Qian
    Qin, Zhan
    Wang, Cong
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2021, 18 (03) : 1354 - 1368
  • [6] Discrete Semantic Matrix Factorization Hashing for Cross-Modal Retrieval
    Qin, Jianyang
    Fei, Lunke
    Teng, Shaohua
    Zhang, Wei
    Liu, Dongning
    Zhao, Genping
    Yuan, Haoliang
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1550 - 1557
  • [7] Learning semantic correlations for cross-media retrieval
    Wu, Fei
    Zhang, Hong
    Zhuang, Yueting
    2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 1465 - +
  • [8] Discovering Semantic Vocabularies for Cross-Media Retrieval
    Habibian, Amirhossein
    Mensink, Thomas
    Snoek, Cees G. M.
    ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 131 - 138
  • [9] SCRATCH: A Scalable Discrete Matrix Factorization Hashing for Cross-Modal Retrieval
    Li, Chuan-Xiang
    Chen, Zhen-Duo
    Zhang, Peng-Fei
    Luo, Xin
    Nie, Liqiang
    Zhang, Wei
    Xu, Xin-Shun
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1 - 9
  • [10] Online Collective Matrix Factorization Hashing for Large-Scale Cross-Media Retrieval
    Wang, Di
    Wang, Quan
    An, Yaqiang
    Gao, Xinbo
    Tian, Yumin
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1409 - 1418