Self-Supervised Temporal Sensitive Hashing for Video Retrieval

被引:0
|
作者
Li, Qihua [1 ]
Tian, Xing [2 ]
Ng, Wing W. Y. [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangdong Prov Key Lab Computat Intelligence & Cyb, Guangzhou 510006, Guangdong, Peoples R China
[2] South China Normal Univ, Sch Artificial Intelligence, Guangzhou 510631, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Hash functions; Sensitivity; Perturbation methods; Long short term memory; Transformers; Training; Robustness; Self-supervise; video hashing; video retrieval; transformer; CLASSIFICATION; LSTM;
D O I
10.1109/TMM.2024.3385183
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Self-supervised video hashing methods retrieve large-scale video data without labels by making full use of visual and temporal information in original videos. Existing methods are not robust enough to handle small temporal differences between similar videos, because of the ignoring of future unseen samples on temporal which leads to large generalization errors. At the same time, existing self-supervised methods cannot preserve pairwise similarity information between large-scale unlabeled data efficiently and effectively. Thus, a self-supervised temporal sensitive video hashing (TSVH) is proposed in the paper for video retrieval. The TSVH uses a transformer-based autoencoder network with temporal sensitivity regularization to achieve low sensitivity of local temporal perturbations and preserve information of global temporal sequence. The pairwise similarity between video samples is effectively preserved by applying a hashing-based affinity matrix in the method. Experiments on realistic datasets show that the TSVH outperforms several state-of-the-art methods and classic methods.
引用
收藏
页码:9021 / 9035
页数:15
相关论文
共 50 条
  • [1] Self-Supervised Locality-Sensitive Deep Hashing for the Robust Retrieval of Degraded Images
    Xiang, Lingyun
    Hu, Hailang
    Li, Qian
    Yu, Hao
    Shen, Xiaobo
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2025, 20 : 1582 - 1596
  • [2] Contrastive Masked Autoencoders for Self-Supervised Video Hashing
    Wang, Yuting
    Wang, Jinpeng
    Chen, Bin
    Zeng, Ziyun
    Xia, Shu-Tao
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 2733 - 2741
  • [3] Self-supervised Video Hashing via Bidirectional Transformers
    Li, Shuyan
    Li, Xiu
    Lu, Jiwen
    Zhou, Jie
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13544 - 13553
  • [4] Relational Consistency Induced Self-Supervised Hashing for Image Retrieval
    Jin, Lu
    Li, Zechao
    Pan, Yonghua
    Tang, Jinhui
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1482 - 1494
  • [5] Self-supervised incomplete cross-modal hashing retrieval
    Peng, Shouyong
    Yao, Tao
    Li, Ying
    Wang, Gang
    Wang, Lili
    Yan, Zhiming
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 262
  • [6] Relational Consistency Induced Self-Supervised Hashing for Image Retrieval
    Jin, Lu
    Li, Zechao
    Pan, Yonghua
    Tang, Jinhui
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1482 - 1494
  • [7] Sparse graph based self-supervised hashing for scalable image retrieval
    Wang, Weiwei
    Zhang, Haofeng
    Zhang, Zheng
    Liu, Li
    Shao, Ling
    INFORMATION SCIENCES, 2021, 547 : 622 - 640
  • [8] Deep Contrastive Self-Supervised Hashing for Remote Sensing Image Retrieval
    Tan, Xiaoyan
    Zou, Yun
    Guo, Ziyang
    Zhou, Ke
    Yuan, Qiangqiang
    REMOTE SENSING, 2022, 14 (15)
  • [9] Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval
    Li, Chao
    Deng, Cheng
    Li, Ning
    Liu, Wei
    Gao, Xinbo
    Tao, Dacheng
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4242 - 4251
  • [10] Self-Supervised Graph Convolution for Video Moment Retrieval
    Hu, Xiwen
    Wang, Guolong
    Shan, Shimin
    Liu, Yu
    Li, Jiangquan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PART X, 2023, 14263 : 407 - 419