Scalable Discrete Matrix Factorization and Semantic Autoencoder for Cross-Media Retrieval

被引:27
|
作者
Zhang, Donglin [1 ,2 ]
Wu, Xiao-Jun [1 ,2 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Jiangsu, Peoples R China
[2] Jiangnan Univ, Jiangsu Prov Engn Lab Pattern Recognit & Computat, Wuxi 214122, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Autoencoder; cross-modal retrieval; hashing; NEAREST-NEIGHBOR; BINARY-CODES; IMAGE SEARCH; QUANTIZATION;
D O I
10.1109/TCYB.2020.3032017
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hashing methods have sparked great attention on multimedia tasks due to their effectiveness and efficiency. However, most existing methods generate binary codes by relaxing the binary constraints, which may cause large quantization error. In addition, most supervised cross-modal approaches preserve the similarity relationship by constructing an n x n large-size similarity matrix, which requires huge computation, making these methods unscalable. To address the above challenges, this article presents a novel algorithm, called scalable discrete matrix factorization and semantic autoencoder method (SDMSA). SDMSA is a two-stage method. In the first stage, the matrix factorization scheme is utilized to learn the latent semantic information, the label matrix is incorporated into the loss function instead of the similarity matrix. Thereafter, the binary codes can be generated by the latent representations. During optimization, we can avoid manipulating a large nxn similarity matrix, and the hash codes can be generated directly. In the second stage, a novel hash function learning scheme based on the autoencoder is proposed. The encoder-decoder paradigm aims to learn projections, the feature vectors are projected to code vectors by encoder, and the code vectors are projected back to the original feature vectors by the decoder. The encoder-decoder scheme ensures the embedding can well preserve both the semantic and feature information. Specifically, two algorithms SDMSA-lin and SDMSA-ker are developed under the SDMSA framework. Owing to the merit of SDMSA, we can get more semantically meaningful binary hash codes. Extensive experiments on several databases show that SDMSA-lin and SDMSA-ker achieve promising performance.
引用
收藏
页码:5947 / 5960
页数:14
相关论文
共 50 条
  • [31] MARS: Learning Modality-Agnostic Representation for Scalable Cross-Media Retrieval
    Wang, Yunbo
    Peng, Yuxin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) : 4765 - 4777
  • [32] Accumulated reconstruction error vector (AREV): a semantic representation for cross-media retrieval
    Liu, Kai
    Wei, Shikui
    Zhao, Yao
    Zhu, Zhenfeng
    Wei, Yunchao
    Xu, Changsheng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (02) : 561 - 576
  • [33] Accumulated reconstruction error vector (AREV): a semantic representation for cross-media retrieval
    Kai Liu
    Shikui Wei
    Yao Zhao
    Zhenfeng Zhu
    Yunchao Wei
    Changsheng Xu
    Multimedia Tools and Applications, 2015, 74 : 561 - 576
  • [34] Research on Cross-media Retrieval Based on Semantic Association for Hospital Information System
    Yang, Yanchun
    Sun, Hongfeng
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 127 : 17 - 17
  • [35] Semantic Constraints Matrix Factorization Hashing for cross-modal retrieval
    Li, Weian
    Xiong, Haixia
    Ou, Weihua
    Gou, Jianping
    Deng, Jiaxing
    Liang, Linqing
    Zhou, Quan
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 100
  • [36] Discrete semantic embedding hashing for scalable cross-modal retrieval
    Liu, Junjie
    Fei, Lunke
    Jia, Wei
    Zhao, Shuping
    Wen, Jie
    Teng, Shaohua
    Zhang, Wei
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 1461 - 1467
  • [37] Supervised Discrete Matrix Factorization Hashing For Cross-Modal Retrieval
    Wu, Fei
    Wu, Zhiyong
    Feng, Yujian
    Zhou, Jun
    Huang, He
    Li, Xinwei
    Dong, Xiwei
    Jing, Xiao Yuan
    PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2018, : 855 - 859
  • [38] Robust and discrete matrix factorization hashing for cross-modal retrieval
    Zhang, Donglin
    Wu, Xiao-Jun
    PATTERN RECOGNITION, 2022, 122
  • [39] Cross-Media Retrieval via Deep Semantic Canonical Correlation Analysis and Logistic Regression
    Zhang, Hong
    Xia, Liangmeng
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 123 - 133
  • [40] Research on Cross-media Semantic Retrieval Methods of Information Resources Based on Deep Learning
    Zhu, Yu
    14TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND EDUCATION (ICCSE 2019), 2019, : 340 - 344