Self-Supervised Temporal Sensitive Hashing for Video Retrieval

被引:0
|
作者
Li, Qihua [1 ]
Tian, Xing [2 ]
Ng, Wing W. Y. [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangdong Prov Key Lab Computat Intelligence & Cyb, Guangzhou 510006, Guangdong, Peoples R China
[2] South China Normal Univ, Sch Artificial Intelligence, Guangzhou 510631, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Hash functions; Sensitivity; Perturbation methods; Long short term memory; Transformers; Training; Robustness; Self-supervise; video hashing; video retrieval; transformer; CLASSIFICATION; LSTM;
D O I
10.1109/TMM.2024.3385183
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Self-supervised video hashing methods retrieve large-scale video data without labels by making full use of visual and temporal information in original videos. Existing methods are not robust enough to handle small temporal differences between similar videos, because of the ignoring of future unseen samples on temporal which leads to large generalization errors. At the same time, existing self-supervised methods cannot preserve pairwise similarity information between large-scale unlabeled data efficiently and effectively. Thus, a self-supervised temporal sensitive video hashing (TSVH) is proposed in the paper for video retrieval. The TSVH uses a transformer-based autoencoder network with temporal sensitivity regularization to achieve low sensitivity of local temporal perturbations and preserve information of global temporal sequence. The pairwise similarity between video samples is effectively preserved by applying a hashing-based affinity matrix in the method. Experiments on realistic datasets show that the TSVH outperforms several state-of-the-art methods and classic methods.
引用
收藏
页码:9021 / 9035
页数:15
相关论文
共 50 条
  • [41] Spatio-Temporal Catcher: a Self-Supervised Transformer for Deepfake Video Detection
    Li, Maosen
    Li, Xurong
    Yu, Kun
    Deng, Cheng
    Huang, Heng
    Mao, Feng
    Xue, Hui
    Li, Minghao
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 8707 - 8718
  • [42] Self-Supervised Video Super-Resolution by Spatial Constraint and Temporal Fusion
    Yang, Cuixin
    Luo, Hongming
    Liao, Guangsen
    Lu, Zitao
    Zhou, Fei
    Qiu, Guoping
    PATTERN RECOGNITION AND COMPUTER VISION,, PT III, 2021, 13021 : 249 - 260
  • [43] Cross-View Temporal Contrastive Learning for Self-Supervised Video Representation
    Wang, Lulu
    Xu, Zengmin
    Zhang, Xuelian
    Meng, Ruxing
    Lu, Tao
    Computer Engineering and Applications, 2024, 60 (18) : 158 - 166
  • [44] Colo-SCRL: Self-Supervised Contrastive Representation Learning for Colonoscopic Video Retrieval
    Chen, Qingzhong
    Cai, Shilun
    Cai, Crystal
    Yu, Zefang
    Qian, Dahong
    Xiang, Suncheng
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1056 - 1061
  • [45] WEAKLY SUPERVISED LOCALITY SENSITIVE HASHING FOR DUPLICATE IMAGE RETRIEVAL
    Cao, Yudong
    Zhang, Honggang
    Guo, Jun
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [46] Self-supervised Vision Transformers for Writer Retrieval
    Raven, Tim
    Matei, Arthur
    Fink, Gernot A.
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT II, 2024, 14805 : 380 - 396
  • [47] SELF-SUPERVISED REMOTE SENSING IMAGE RETRIEVAL
    Walter, Kane
    Gibson, Matthew J.
    Sowmya, Arcot
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 1683 - 1686
  • [48] Contrastive Self-Supervised Learning as a Strong Baseline for Unsupervised Hashing
    Yang, Huei-Fang
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [49] High-Order Correlation-Guided Slide-Level Histology Retrieval With Self-Supervised Hashing
    Li, Shengrui
    Zhao, Yining
    Zhang, Jun
    Yu, Ting
    Zhang, Ji
    Gao, Yue
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (09) : 11008 - 11023
  • [50] Self-supervised Learning for Endoscopic Video Analysis
    Hirsch, Roy
    Caron, Mathilde
    Cohen, Regev
    Livne, Amir
    Shapiro, Ron
    Golany, Tomer
    Goldenberg, Roman
    Freedman, Daniel
    Rivlin, Ehud
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT V, 2023, 14224 : 569 - 578