An Efficient Cross-Modal Privacy-Preserving Image-Text Retrieval Scheme

被引：1

作者：

Zhang, Kejun ^{[1
,2
]}

Xu, Shaofei ^{[1
]}

Song, Yutuo ^{[2
]}

Xu, Yuwei ^{[3
]}

Li, Pengcheng ^{[2
]}

Yang, Xiang ^{[1
]}

Zou, Bing ^{[1
]}

Wang, Wenbin ^{[1
]}

机构：

[1] Beijing Elect Sci & Technol Inst, Beijing 100070, Peoples R China

[2] Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Peoples R China

[3] Xian Univ Technol, Sch Automat & Informat Engn, Xian 710048, Peoples R China

来源：

SYMMETRY-BASEL | 2024年 / 16卷 / 08期

关键词：

privacy-preserving; searchable encryption; image-text retrieval; cross-modal retrieval; SEARCH;

D O I：

10.3390/sym16081084

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Preserving the privacy of the ever-increasing multimedia data on the cloud while providing accurate and fast retrieval services has become a hot topic in information security. However, existing relevant schemes still have significant room for improvement in accuracy and speed. Therefore, this paper proposes a privacy-preserving image-text retrieval scheme called PITR. To enhance model performance with minimal parameter training, we freeze all parameters of a multimodal pre-trained model and incorporate trainable modules along with either a general adapter or a specialized adapter, which are used to enhance the model's ability to perform zero-shot image classification and cross-modal retrieval in general or specialized datasets, respectively. To preserve the privacy of outsourced data on the cloud and the privacy of the user's retrieval process, we employ asymmetric scalar-product-preserving encryption technology suitable for inner product calculation, and we employ distributed index storage technology and construct a two-level security model. We construct a hierarchical index structure to speed up query matching among massive high-dimensional index vectors. Experimental results demonstrate that our scheme can provide users with secure, accurate, fast cross-modal retrieval service while preserving data privacy.

引用

页数：19

共 50 条

[41] IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text Retrieval
Chen, Hui
Ding, Guiguang
Liu, Xudong
Lin, Zijia
Liu, Ji
Han, Jungong
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 12652 - 12660
[42] A Deep Semantic Alignment Network for the Cross-Modal Image-Text Retrieval in Remote Sensing
Cheng, Qimin
Zhou, Yuzhuo
Fu, Peng
Xu, Yuan
Zhang, Liang
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 4284 - 4297
[43] Cross-modal Scene Graph Matching for Relationship-aware Image-Text Retrieval
Wang, Sijin
Wang, Ruiping
Yao, Ziwei
Shan, Shiguang
Chen, Xilin
2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1497 - 1506
[44] Strong and Weak Prompt Engineering for Remote Sensing Image-Text Cross-Modal Retrieval
Sun, Tianci
Zheng, Chengyu
Li, Xiu
Nie, Jie
Gao, Yanli
Huang, Lei
Wei, Zhiqiang
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 6968 - 6980
[45] Dual-branch networks for privacy-preserving cross-modal retrieval in cloud computing
Peng, Jianting
Xiang, Xuyu
Qin, Jiaohua
Tan, Yun
JOURNAL OF SUPERCOMPUTING, 2025, 81 (01):
[46] A TEXTURE AND SALIENCY ENHANCED IMAGE LEARNING METHOD FOR CROSS-MODAL REMOTE SENSING IMAGE-TEXT RETRIEVAL
Yang, Rui
Zhang, Di
Guo, YanHe
Wang, Shuang
IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 4895 - 4898
[47] Deep Cross-Modal Projection Learning for Image-Text Matching
Zhang, Ying
Lu, Huchuan
COMPUTER VISION - ECCV 2018, PT I, 2018, 11205 : 707 - 723
[48] Review of unlabeled image-text cross-modal retrieval based on real-valued features
Zhang, Li
Chen, Kang
Sun, Guanghui
Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2024, 56 (09): : 1 - 16
[49] MULTI-SCALE INTERACTIVE TRANSFORMER FOR REMOTE SENSING CROSS-MODAL IMAGE-TEXT RETRIEVAL
Wang, Yijing
Ma, Jingjing
Li, Mingteng
Tang, Xu
Han, Xiao
Jiao, Licheng
2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 839 - 842
[50] Cross-modal Semantically Augmented Network for Image-text Matching
Yao, Tao
Li, Yiru
Li, Ying
Zhu, Yingying
Wang, Gang
Yue, Jun
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (04)

← 1 2 3 4 5 →