Self-Supervised Temporal Sensitive Hashing for Video Retrieval

被引:0
|
作者
Li, Qihua [1 ]
Tian, Xing [2 ]
Ng, Wing W. Y. [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangdong Prov Key Lab Computat Intelligence & Cyb, Guangzhou 510006, Guangdong, Peoples R China
[2] South China Normal Univ, Sch Artificial Intelligence, Guangzhou 510631, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Hash functions; Sensitivity; Perturbation methods; Long short term memory; Transformers; Training; Robustness; Self-supervise; video hashing; video retrieval; transformer; CLASSIFICATION; LSTM;
D O I
10.1109/TMM.2024.3385183
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Self-supervised video hashing methods retrieve large-scale video data without labels by making full use of visual and temporal information in original videos. Existing methods are not robust enough to handle small temporal differences between similar videos, because of the ignoring of future unseen samples on temporal which leads to large generalization errors. At the same time, existing self-supervised methods cannot preserve pairwise similarity information between large-scale unlabeled data efficiently and effectively. Thus, a self-supervised temporal sensitive video hashing (TSVH) is proposed in the paper for video retrieval. The TSVH uses a transformer-based autoencoder network with temporal sensitivity regularization to achieve low sensitivity of local temporal perturbations and preserve information of global temporal sequence. The pairwise similarity between video samples is effectively preserved by applying a hashing-based affinity matrix in the method. Experiments on realistic datasets show that the TSVH outperforms several state-of-the-art methods and classic methods.
引用
收藏
页码:9021 / 9035
页数:15
相关论文
共 50 条
  • [21] Self-Supervised Video Action Localization with Adversarial Temporal Transforms
    Gong, Guoqiang
    Zheng, Liangfeng
    Jiang, Wenhao
    Mu, Yadong
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 693 - 699
  • [22] Self-supervised Bernoulli Autoencoders for Semi-supervised Hashing
    Nanculef, Ricardo
    Mena, Francisco
    Macaluso, Antonio
    Lodi, Stefano
    Sartori, Claudio
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2021, 2021, 12702 : 258 - 268
  • [23] Self-supervised Video Transformer
    Ranasinghe, Kanchana
    Naseer, Muzammal
    Khan, Salman
    Khan, Fahad Shahbaz
    Ryoo, Michael S.
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2864 - 2874
  • [24] Learning From Self-Supervised Features for Hashing-Based Remote Sensing Image Retrieval
    Tang, Jiayi
    Wang, Dali
    Tong, Xiaochong
    Qiu, Chunping
    Yang, Weiming
    Lei, Yi
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [25] Graph Convolutional Network Semantic Enhancement Hashing for Self-supervised Cross-Modal Retrieval
    Hu, Jinyu
    Li, Mingyong
    Zhang, Jiayan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IV, 2023, 14257 : 410 - 422
  • [26] Self-supervised Label-Visual Correlation Hashing for Multi-label Image Retrieval
    Liu, Yu
    Xie, Yanzhao
    Song, Jingkuan
    Wei, Rukai
    Zhou, Ke
    WEB AND BIG DATA, PT II, APWEB-WAIM 2022, 2023, 13422 : 129 - 143
  • [27] Self-Supervised Cluster-Contrast Distillation Hashing Network for Cross-Modal Retrieval
    Sun, Haoxuan
    Cao, Yudong
    Liu, Guangyuan
    IEEE ACCESS, 2023, 11 : 96584 - 96593
  • [28] Temporal Scene Montage for Self-Supervised Video Scene Boundary Detection
    Tan, Jiawei
    Yang, Pingan
    Chen, Lu
    Wang, Hongxing
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (07)
  • [29] TCGL: Temporal Contrastive Graph for Self-Supervised Video Representation Learning
    Liu, Yang
    Wang, Keze
    Liu, Lingbo
    Lan, Haoyuan
    Lin, Liang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1978 - 1993
  • [30] Temporal DINO: A Self-supervised Video Strategy to Enhance Action Prediction
    Teeti, Izzeddin
    Bhargav, Rongali Sai
    Singh, Vivek
    Bradley, Andrew
    Banerjee, Biplab
    Cuzzolin, Fabio
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3273 - 3283