Cross-modal image-text search via Efficient Discrete Class Alignment Hashing

被引：15

作者：

Wang, Song ^{[1
,2
]}

Zhao, Huan ^{[1
,2
]}

Wang, Yunbo ^{[3
]}

Huang, Jing ^{[1
,2
]}

Li, Keqin ^{[1
,2
,4
]}

机构：

[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Peoples R China

[2] Key Lab Embedded & Network Comp Hunan Prov, Changsha 410082, Peoples R China

[3] Peking Univ, Wangxuan Inst Comp Technol, Beijing 100080, Peoples R China

[4] SUNY Coll New Paltz, Dept Comp Sci, New Paltz, NY 12561 USA

来源：

INFORMATION PROCESSING & MANAGEMENT | 2022年 / 59卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Class alignment; Cross-modal image-text search; Hash code; Supervised hashing; BINARY-CODES;

D O I：

10.1016/j.ipm.2022.102886

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Hashing has produced enormous potentials in cross-modal image-text search, which learns compact binary codes by exploring the correlations between distinct modalities. However, there still exist some limitations. First, most existing methods neglect the relation between the data characteristics and supervised information. Second, a relaxation strategy results in large quantization errors. Third, constructing large n x n (a.k.a. training size) similarity graphs increases computational load. To address these issues, we propose a novel discrete supervised hashing method, termed Efficient Discrete Class Alignment Hashing (EDCAH), which integrates class alignment and matrix factorization for hashing learning. Specifically, it exploits the semantic consistency of data instances and informative labels to simultaneously learn the hash codes and hash functions. Meanwhile, a discrete optimization strategy is developed to solve the EDCAH, which is beneficial to generate high-quality hash codes. Furthermore, to improve the learning efficiency of EDCAH, we propose a fast and efficient variant dubbed EDCAH-t that utilizes a two-step hashing strategy. Extensive experiments demonstrate the superiority of EDCAH and EDCAH-t in both search accuracy and learning efficiency.

引用

页数：17

共 50 条

[1] Discrete Joint Semantic Alignment Hashing for Cross-Modal Image-Text Search
Wang, Song
Zhao, Huan
Li, Keqin
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 8022 - 8036
[2] Cross-Modal Image-Text Matching via Coupled Projection Learning Hashing
Zhao, Huan
Wang, Haoqian
Zha, Xupeng
Wang, Song
2022 IEEE 9TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2022, : 367 - 376
[3] Adaptive Cross-Modal Embeddings for Image-Text Alignment
Wehrmann, Pinatas
Kolling, Camila
Barros, Rodrigo C.
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12313 - 12320
[4] Cross-modal alignment with graph reasoning for image-text retrieval
Zheng Cui
Yongli Hu
Yanfeng Sun
Junbin Gao
Baocai Yin
Multimedia Tools and Applications, 2022, 81 : 23615 - 23632
[5] Cross-modal alignment with graph reasoning for image-text retrieval
Cui, Zheng
Hu, Yongli
Sun, Yanfeng
Gao, Junbin
Yin, Baocai
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (17) : 23615 - 23632
[6] Hierarchical modal interaction balance cross-modal hashing for unsupervised image-text retrieval
Zhang J.
Lin Z.
Jiang X.
Li M.
Wang C.
Multimedia Tools and Applications, 2024, 83 (42) : 90487 - 90509
[7] DEEP RANK CROSS-MODAL HASHING WITH SEMANTIC CONSISTENT FOR IMAGE-TEXT RETRIEVAL
Liu, Xiaoqing
Zeng, Huanqiang
Shi, Yifan
Zhu, Jianqing
Ma, Kai-Kuang
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2022, 2022-May : 4828 - 4832
[8] Improving Image-Text Matching With Bidirectional Consistency of Cross-Modal Alignment
Li, Zhe
Zhang, Lei
Zhang, Kun
Zhang, Yongdong
Mao, Zhendong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 6590 - 6607
[9] DEEP RANK CROSS-MODAL HASHING WITH SEMANTIC CONSISTENT FOR IMAGE-TEXT RETRIEVAL
Liu, Xiaoqing
Zeng, Huanqiang
Shi, Yifan
Zhu, Jianqing
Ma, Kai-Kuang
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4828 - 4832
[10] Unsupervised deep hashing with multiple similarity preservation for cross-modal image-text retrieval
Xiong, Siyu
Pan, Lili
Ma, Xueqiang
Hu, Qinghua
Beckman, Eric
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (10) : 4423 - 4434

← 1 2 3 4 5 →