Cross-modal hashing with missing labels

被引：8

作者：

Ni, Haomin ^{[1
,3
]}

Zhang, Jianjun ^{[2
]}

Kang, Peipei ^{[2
]}

Fang, Xiaozhao ^{[1
,4
]}

Sun, Weijun ^{[5
]}

Xie, Shengli ^{[1
,3
]}

Han, Na ^{[6
]}

机构：

[1] Guangdong Univ Technol, Sch Automat, 100 Waihuan Xi Rd, Guangzhou 510006, Guangdong, Peoples R China

[2] Guangdong Univ Technol, Sch Comp Sci & Technol, 100 Waihuan Xi Rd, Guangzhou 510006, Guangdong, Peoples R China

[3] GDUT, Guangdong Key Lab IoT Informat Technol, 100 Waihuan Xi Rd, Guangzhou 510006, Guangdong, Peoples R China

[4] GDUT, Key Lab Intelligent Detect & Internet Things Mfg, 100 Waihuan Xi Rd, Guangzhou 510006, Guangdong, Peoples R China

[5] GDUT, Guangdong HongKong Macao Joint Lab Smart Discrete, 100 Waihuan Xi Rd, Guangzhou 510006, Guangdong, Peoples R China

[6] Guangdong Polytech Normal Univ, Sch Comp Sci, 293 Zhonghshan Dadao, Guangzhou 510665, Guangdong, Peoples R China

来源：

NEURAL NETWORKS | 2023年 / 165卷

基金：

中国国家自然科学基金; 美国国家科学基金会;

关键词：

Cross-modal retrieval; Hashing method; Weak supervision; Missing labels; REPRESENTATION;

D O I：

10.1016/j.neunet.2023.05.035

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Hashing-based cross-modal retrieval methods have become increasingly popular due to their advan-tages in storage and speed. While current methods have demonstrated impressive results, there are still several issues that have not been addressed. Specifically, many of these approaches assume that labels are perfectly assigned, despite the fact that in real-world scenarios, labels are often incomplete or partially missing. There are two reasons for this, as manual labeling can be a complex and time-consuming task, and annotators may only be interested in certain objects. As such, cross-modal retrieval with missing labels is a significant challenge that requires further attention. Moreover, the similarity between labels is frequently ignored, which is important for exploring the high-level semantics of labels. To address these limitations, we propose a novel method called Cross-Modal Hashing with Missing Labels (CMHML). Our method consists of several key components. First, we introduce Reliable Label Learning to preserve reliable information from the observed labels. Next, to infer the uncertain part of the predicted labels, we decompose the predicted labels into latent representations of labels and samples. The representation of samples is extracted from different modalities, which assists in inferring missing labels. We also propose Label Correlation Preservation to enhance the similarity between latent representations of labels. Hash codes are then learned from the representation of samples through Global Approximation Learning. We also construct a similarity matrix according to predicted labels and embed it into hash codes learning to explore the value of labels. Finally, we train linear classifiers to map original samples to a low-dimensional Hamming space. To evaluate the efficacy of CMHML, we conduct extensive experiments on four publicly available datasets. Our method is compared to other state-of-the-art methods, and the results demonstrate that our model performs competitively even when most labels are missing.& COPY; 2023 Elsevier Ltd. All rights reserved.

引用

页码：60 / 76

页数：17

共 50 条

[21] Unsupervised Contrastive Cross-Modal Hashing
Hu, Peng
Zhu, Hongyuan
Lin, Jie
Peng, Dezhong
Zhao, Yin-Ping
Peng, Xi
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3877 - 3889
[22] Frustratingly Easy Cross-Modal Hashing
Ma, Dekui
Liang, Jian
Kong, Xiangwei
He, Ran
MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE, 2016, : 237 - 241
[23] Hashing for Cross-Modal Similarity Retrieval
Liu, Yao
Yuan, Yanhong
Huang, Qiaoli
Huang, Zhixing
2015 11TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG), 2015, : 1 - 8
[24] Supervised Hierarchical Cross-Modal Hashing
Sun, Changchang
Song, Xuemeng
Feng, Fuli
Zhao, Wayne Xin
Zhang, Hao
Nie, Liqiang
PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 725 - 734
[25] Online Discriminative Cross-Modal Hashing
Kang, Xiao
Liu, Xingbo
Zhang, Xuening
Nie, Xiushan
Yin, Yilong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 5242 - 5254
[26] Cross-Modal Deep Variational Hashing
Liong, Venice Erin
Lu, Jiwen
Tan, Yap-Peng
Zhou, Jie
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4097 - 4105
[27] Completely Unsupervised Cross-Modal Hashing
Duan, Jiasheng
Zhang, Pengfei
Huang, Zi
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT I, 2020, 12112 : 178 - 194
[28] Asymmetric Deep Cross-modal Hashing
Gu, Jingzi
Zhang, JinChao
Lin, Zheng
Li, Bo
Wang, Weiping
Meng, Dan
COMPUTATIONAL SCIENCE - ICCS 2019, PT V, 2019, 11540 : 41 - 54
[29] Contrastive Incomplete Cross-Modal Hashing
Luo, Haoyang
Zhang, Zheng
Nie, Liqiang
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (11) : 5823 - 5834
[30] Mask Cross-Modal Hashing Networks
Lin, Qiubin
Cao, Wenming
He, Zhiquan
He, Zhihai
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 550 - 558

← 1 2 3 4 5 →