Self-Supervised Temporal Sensitive Hashing for Video Retrieval

被引：0

作者：

Li, Qihua ^{[1
]}

Tian, Xing ^{[2
]}

Ng, Wing W. Y. ^{[1
]}

机构：

[1] South China Univ Technol, Sch Comp Sci & Engn, Guangdong Prov Key Lab Computat Intelligence & Cyb, Guangzhou 510006, Guangdong, Peoples R China

[2] South China Normal Univ, Sch Artificial Intelligence, Guangzhou 510631, Guangdong, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2024年 / 26卷

基金：

中国国家自然科学基金;

关键词：

Hash functions; Sensitivity; Perturbation methods; Long short term memory; Transformers; Training; Robustness; Self-supervise; video hashing; video retrieval; transformer; CLASSIFICATION; LSTM;

D O I：

10.1109/TMM.2024.3385183

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Self-supervised video hashing methods retrieve large-scale video data without labels by making full use of visual and temporal information in original videos. Existing methods are not robust enough to handle small temporal differences between similar videos, because of the ignoring of future unseen samples on temporal which leads to large generalization errors. At the same time, existing self-supervised methods cannot preserve pairwise similarity information between large-scale unlabeled data efficiently and effectively. Thus, a self-supervised temporal sensitive video hashing (TSVH) is proposed in the paper for video retrieval. The TSVH uses a transformer-based autoencoder network with temporal sensitivity regularization to achieve low sensitivity of local temporal perturbations and preserve information of global temporal sequence. The pairwise similarity between video samples is effectively preserved by applying a hashing-based affinity matrix in the method. Experiments on realistic datasets show that the TSVH outperforms several state-of-the-art methods and classic methods.

引用

页码：9021 / 9035

页数：15

共 50 条

[1] Self-Supervised Locality-Sensitive Deep Hashing for the Robust Retrieval of Degraded Images
Xiang, Lingyun
Hu, Hailang
Li, Qian
Yu, Hao
Shen, Xiaobo
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2025, 20 : 1582 - 1596
[2] Contrastive Masked Autoencoders for Self-Supervised Video Hashing
Wang, Yuting
Wang, Jinpeng
Chen, Bin
Zeng, Ziyun
Xia, Shu-Tao
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 2733 - 2741
[3] Self-supervised Video Hashing via Bidirectional Transformers
Li, Shuyan
Li, Xiu
Lu, Jiwen
Zhou, Jie
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13544 - 13553
[4] Relational Consistency Induced Self-Supervised Hashing for Image Retrieval
Jin, Lu
Li, Zechao
Pan, Yonghua
Tang, Jinhui
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1482 - 1494
[5] Self-supervised incomplete cross-modal hashing retrieval
Peng, Shouyong
Yao, Tao
Li, Ying
Wang, Gang
Wang, Lili
Yan, Zhiming
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 262
[6] Relational Consistency Induced Self-Supervised Hashing for Image Retrieval
Jin, Lu
Li, Zechao
Pan, Yonghua
Tang, Jinhui
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1482 - 1494
[7] Sparse graph based self-supervised hashing for scalable image retrieval
Wang, Weiwei
Zhang, Haofeng
Zhang, Zheng
Liu, Li
Shao, Ling
INFORMATION SCIENCES, 2021, 547 : 622 - 640
[8] Deep Contrastive Self-Supervised Hashing for Remote Sensing Image Retrieval
Tan, Xiaoyan
Zou, Yun
Guo, Ziyang
Zhou, Ke
Yuan, Qiangqiang
REMOTE SENSING, 2022, 14 (15)
[9] Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval
Li, Chao
Deng, Cheng
Li, Ning
Liu, Wei
Gao, Xinbo
Tao, Dacheng
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4242 - 4251
[10] Self-Supervised Graph Convolution for Video Moment Retrieval
Hu, Xiwen
Wang, Guolong
Shan, Shimin
Liu, Yu
Li, Jiangquan
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PART X, 2023, 14263 : 407 - 419

← 1 2 3 4 5 →