Self-Supervised Temporal Sensitive Hashing for Video Retrieval

被引:0
|
作者
Li, Qihua [1 ]
Tian, Xing [2 ]
Ng, Wing W. Y. [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangdong Prov Key Lab Computat Intelligence & Cyb, Guangzhou 510006, Guangdong, Peoples R China
[2] South China Normal Univ, Sch Artificial Intelligence, Guangzhou 510631, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Hash functions; Sensitivity; Perturbation methods; Long short term memory; Transformers; Training; Robustness; Self-supervise; video hashing; video retrieval; transformer; CLASSIFICATION; LSTM;
D O I
10.1109/TMM.2024.3385183
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Self-supervised video hashing methods retrieve large-scale video data without labels by making full use of visual and temporal information in original videos. Existing methods are not robust enough to handle small temporal differences between similar videos, because of the ignoring of future unseen samples on temporal which leads to large generalization errors. At the same time, existing self-supervised methods cannot preserve pairwise similarity information between large-scale unlabeled data efficiently and effectively. Thus, a self-supervised temporal sensitive video hashing (TSVH) is proposed in the paper for video retrieval. The TSVH uses a transformer-based autoencoder network with temporal sensitivity regularization to achieve low sensitivity of local temporal perturbations and preserve information of global temporal sequence. The pairwise similarity between video samples is effectively preserved by applying a hashing-based affinity matrix in the method. Experiments on realistic datasets show that the TSVH outperforms several state-of-the-art methods and classic methods.
引用
收藏
页码:9021 / 9035
页数:15
相关论文
共 50 条
  • [31] Video Cloze Procedure for Self-Supervised Spatio-Temporal Learning
    Luo, Dezhao
    Liu, Chang
    Zhou, Yu
    Yang, Dongbao
    Ma, Can
    Ye, Qixiang
    Wang, Weiping
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11701 - 11708
  • [32] Self-Supervised Cross-Video Temporal Learning for Unsupervised Video Domain Adaptation
    Choi, Jinwoo
    Huang, Jia-Bin
    Sharma, Gaurav
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3464 - 3470
  • [33] Contrastive Self-Supervised Hashing With Dual Pseudo Agreement
    Li, Yang
    Wang, Yapeng
    Miao, Zhuang
    Wang, Jiabao
    Zhang, Rui
    IEEE ACCESS, 2020, 8 : 165034 - 165043
  • [34] Deep Discrete Hashing with Self-supervised Pairwise Labels
    Song, Jingkuan
    He, Tao
    Fan, Hangbo
    Gao, Lianli
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2017, PT I, 2017, 10534 : 223 - 238
  • [35] Deep Self-Supervised Hashing With Fine-Grained Similarity Mining for Cross-Modal Retrieval
    Han, Lijun
    Wang, Renlin
    Chen, Chunlei
    Zhang, Huihui
    Zhang, Yujie
    Zhang, Wenfeng
    IEEE ACCESS, 2024, 12 : 31756 - 31770
  • [36] Self-supervised learning-based weight adaptive hashing for fast cross-modal retrieval
    Yifan Li
    Xuan Wang
    Shuhan Qi
    Chengkai Huang
    Zoe. L Jiang
    Qing Liao
    Jian Guan
    Jiajia Zhang
    Signal, Image and Video Processing, 2021, 15 : 673 - 680
  • [37] Self-supervised learning-based weight adaptive hashing for fast cross-modal retrieval
    Li, Yifan
    Wang, Xuan
    Qi, Shuhan
    Huang, Chengkai
    Jiang, Zoe L.
    Liao, Qing
    Guan, Jian
    Zhang, Jiajia
    SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (04) : 673 - 680
  • [38] Self-Supervised Video Representation Learning by Uncovering Spatio-Temporal Statistics
    Wang, Jiangliu
    Jiao, Jianbo
    Bao, Linchao
    He, Shengfeng
    Liu, Wei
    Liu, Yun-hui
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (07) : 3791 - 3806
  • [39] Attentive spatial-temporal contrastive learning for self-supervised video representation
    Yang, Xingming
    Xiong, Sixuan
    Wu, Kewei
    Shan, Dongfeng
    Xie, Zhao
    IMAGE AND VISION COMPUTING, 2023, 137
  • [40] Contrastive Spatio-Temporal Pretext Learning for Self-Supervised Video Representation
    Zhang, Yujia
    Po, Lai-Man
    Xu, Xuyuan
    Liu, Mengyang
    Wang, Yexin
    Ou, Weifeng
    Zhao, Yuzhi
    Yu, Wing-Yin
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 3380 - 3389