Cross-modal image-text search via Efficient Discrete Class Alignment Hashing

被引：15

作者：

Wang, Song ^{[1
,2
]}

Zhao, Huan ^{[1
,2
]}

Wang, Yunbo ^{[3
]}

Huang, Jing ^{[1
,2
]}

Li, Keqin ^{[1
,2
,4
]}

机构：

[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Peoples R China

[2] Key Lab Embedded & Network Comp Hunan Prov, Changsha 410082, Peoples R China

[3] Peking Univ, Wangxuan Inst Comp Technol, Beijing 100080, Peoples R China

[4] SUNY Coll New Paltz, Dept Comp Sci, New Paltz, NY 12561 USA

来源：

INFORMATION PROCESSING & MANAGEMENT | 2022年 / 59卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Class alignment; Cross-modal image-text search; Hash code; Supervised hashing; BINARY-CODES;

D O I：

10.1016/j.ipm.2022.102886

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Hashing has produced enormous potentials in cross-modal image-text search, which learns compact binary codes by exploring the correlations between distinct modalities. However, there still exist some limitations. First, most existing methods neglect the relation between the data characteristics and supervised information. Second, a relaxation strategy results in large quantization errors. Third, constructing large n x n (a.k.a. training size) similarity graphs increases computational load. To address these issues, we propose a novel discrete supervised hashing method, termed Efficient Discrete Class Alignment Hashing (EDCAH), which integrates class alignment and matrix factorization for hashing learning. Specifically, it exploits the semantic consistency of data instances and informative labels to simultaneously learn the hash codes and hash functions. Meanwhile, a discrete optimization strategy is developed to solve the EDCAH, which is beneficial to generate high-quality hash codes. Furthermore, to improve the learning efficiency of EDCAH, we propose a fast and efficient variant dubbed EDCAH-t that utilizes a two-step hashing strategy. Extensive experiments demonstrate the superiority of EDCAH and EDCAH-t in both search accuracy and learning efficiency.

引用

页数：17

共 50 条

[21] Cross-Modal Discrete Hashing
Liong, Venice Erin
Lu, Jiwen
Tan, Yap-Peng
PATTERN RECOGNITION, 2018, 79 : 114 - 129
[22] Discrete Cross-Modal Hashing for Efficient Multimedia Retrieval
Ma, Dekui
Liang, Jian
Kong, Xiangwei
He, Ran
Li, Ying
PROCEEDINGS OF 2016 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2016, : 38 - 43
[23] TECMH: Transformer-Based Cross-Modal Hashing For Fine-Grained Image-Text Retrieval
Li, Qiqi
Ma, Longfei
Jiang, Zheng
Li, Mingyong
Jin, Bo
CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (02): : 3713 - 3728
[24] AN UNSUPERVISED CROSS-MODAL HASHING METHOD ROBUST TO NOISY TRAINING IMAGE-TEXT CORRESPONDENCES IN REMOTE SENSING
Mikriukov, Georgii
Ravanbakhsh, Mahdyar
Demir, Beguem
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2556 - 2560
[25] Image-Text Embedding with Hierarchical Knowledge for Cross-Modal Retrieval
Seo, Sanghyun
Kim, Juntae
PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 350 - 353
[26] Deep Cross-Modal Projection Learning for Image-Text Matching
Zhang, Ying
Lu, Huchuan
COMPUTER VISION - ECCV 2018, PT I, 2018, 11205 : 707 - 723
[27] Image-Text Retrieval With Cross-Modal Semantic Importance Consistency
Liu, Zejun
Chen, Fanglin
Xu, Jun
Pei, Wenjie
Lu, Guangming
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (05) : 2465 - 2476
[28] Cross-modal Semantically Augmented Network for Image-text Matching
Yao, Tao
Li, Yiru
Li, Ying
Zhu, Yingying
Wang, Gang
Yue, Jun
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (04)
[29] Cross-Modal Attention With Semantic Consistence for Image-Text Matching
Xu, Xing
Wang, Tan
Yang, Yang
Zuo, Lin
Shen, Fumin
Shen, Heng Tao
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (12) : 5412 - 5425
[30] Robust multimodal discrete hashing for cross-modal similarity search
Fang, Yuzhi
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 79

← 1 2 3 4 5 →