Cross-modal hashing with missing labels

被引:8
|
作者
Ni, Haomin [1 ,3 ]
Zhang, Jianjun [2 ]
Kang, Peipei [2 ]
Fang, Xiaozhao [1 ,4 ]
Sun, Weijun [5 ]
Xie, Shengli [1 ,3 ]
Han, Na [6 ]
机构
[1] Guangdong Univ Technol, Sch Automat, 100 Waihuan Xi Rd, Guangzhou 510006, Guangdong, Peoples R China
[2] Guangdong Univ Technol, Sch Comp Sci & Technol, 100 Waihuan Xi Rd, Guangzhou 510006, Guangdong, Peoples R China
[3] GDUT, Guangdong Key Lab IoT Informat Technol, 100 Waihuan Xi Rd, Guangzhou 510006, Guangdong, Peoples R China
[4] GDUT, Key Lab Intelligent Detect & Internet Things Mfg, 100 Waihuan Xi Rd, Guangzhou 510006, Guangdong, Peoples R China
[5] GDUT, Guangdong HongKong Macao Joint Lab Smart Discrete, 100 Waihuan Xi Rd, Guangzhou 510006, Guangdong, Peoples R China
[6] Guangdong Polytech Normal Univ, Sch Comp Sci, 293 Zhonghshan Dadao, Guangzhou 510665, Guangdong, Peoples R China
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Cross-modal retrieval; Hashing method; Weak supervision; Missing labels; REPRESENTATION;
D O I
10.1016/j.neunet.2023.05.035
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hashing-based cross-modal retrieval methods have become increasingly popular due to their advan-tages in storage and speed. While current methods have demonstrated impressive results, there are still several issues that have not been addressed. Specifically, many of these approaches assume that labels are perfectly assigned, despite the fact that in real-world scenarios, labels are often incomplete or partially missing. There are two reasons for this, as manual labeling can be a complex and time-consuming task, and annotators may only be interested in certain objects. As such, cross-modal retrieval with missing labels is a significant challenge that requires further attention. Moreover, the similarity between labels is frequently ignored, which is important for exploring the high-level semantics of labels. To address these limitations, we propose a novel method called Cross-Modal Hashing with Missing Labels (CMHML). Our method consists of several key components. First, we introduce Reliable Label Learning to preserve reliable information from the observed labels. Next, to infer the uncertain part of the predicted labels, we decompose the predicted labels into latent representations of labels and samples. The representation of samples is extracted from different modalities, which assists in inferring missing labels. We also propose Label Correlation Preservation to enhance the similarity between latent representations of labels. Hash codes are then learned from the representation of samples through Global Approximation Learning. We also construct a similarity matrix according to predicted labels and embed it into hash codes learning to explore the value of labels. Finally, we train linear classifiers to map original samples to a low-dimensional Hamming space. To evaluate the efficacy of CMHML, we conduct extensive experiments on four publicly available datasets. Our method is compared to other state-of-the-art methods, and the results demonstrate that our model performs competitively even when most labels are missing.& COPY; 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页码:60 / 76
页数:17
相关论文
共 50 条
  • [1] Online hashing with partially known labels for cross-modal retrieval
    Shu, Zhenqiu
    Li, Li
    Yu, Zhengtao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
  • [2] Deep Cross-Modal Hashing With Ranking Learning for Noisy Labels
    Shu, Zhenqiu
    Bai, Yibing
    Yong, Kailing
    Yu, Zhengtao
    IEEE TRANSACTIONS ON BIG DATA, 2025, 11 (02) : 553 - 565
  • [3] Exploiting Subspace Relation in Semantic Labels for Cross-Modal Hashing
    Shen, Heng Tao
    Liu, Luchen
    Yang, Yang
    Xu, Xing
    Huang, Zi
    Shen, Fumin
    Hong, Richang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (10) : 3351 - 3365
  • [4] Two-stage zero-shot sparse hashing with missing labels for cross-modal retrieval
    Yong, Kailing
    Shu, Zhenqiu
    Wang, Hongbin
    Yu, Zhengtao
    PATTERN RECOGNITION, 2024, 155
  • [5] Fast Discrete Cross-modal Hashing With Regressing From Semantic Labels
    Liu, Xingbo
    Nie, Xiushan
    Zeng, Wenjun
    Cui, Chaoran
    Zhu, Lei
    Yin, Yilong
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1662 - 1669
  • [6] A GENERAL FRAMEWORK FOR INCOMPLETE CROSS-MODAL RETRIEVAL WITH MISSING LABELS AND MISSING MODALITIES
    Li, Mingyang
    Huang, Shao-Lun
    Zhang, Lin
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4763 - 4767
  • [7] Regularised Cross-Modal Hashing
    Moran, Sean
    Lavrenko, Victor
    SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 907 - 910
  • [8] Flexible Cross-Modal Hashing
    Yu, Guoxian
    Liu, Xuanwu
    Wang, Jun
    Domeniconi, Carlotta
    Zhang, Xiangliang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (01) : 304 - 314
  • [9] Extensible Cross-Modal Hashing
    Chen, Tian-yi
    Zhang, Lan
    Zhang, Shi-cong
    Li, Zi-long
    Huang, Bai-chuan
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2109 - 2115
  • [10] Discriminant Cross-modal Hashing
    Xu, Xing
    Shen, Fumin
    Yang, Yang
    Shen, Heng Tao
    ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 305 - 308