Self-Supervised Temporal Sensitive Hashing for Video Retrieval

被引：0

作者：

Li, Qihua ^{[1
]}

Tian, Xing ^{[2
]}

Ng, Wing W. Y. ^{[1
]}

机构：

[1] South China Univ Technol, Sch Comp Sci & Engn, Guangdong Prov Key Lab Computat Intelligence & Cyb, Guangzhou 510006, Guangdong, Peoples R China

[2] South China Normal Univ, Sch Artificial Intelligence, Guangzhou 510631, Guangdong, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2024年 / 26卷

基金：

中国国家自然科学基金;

关键词：

Hash functions; Sensitivity; Perturbation methods; Long short term memory; Transformers; Training; Robustness; Self-supervise; video hashing; video retrieval; transformer; CLASSIFICATION; LSTM;

D O I：

10.1109/TMM.2024.3385183

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Self-supervised video hashing methods retrieve large-scale video data without labels by making full use of visual and temporal information in original videos. Existing methods are not robust enough to handle small temporal differences between similar videos, because of the ignoring of future unseen samples on temporal which leads to large generalization errors. At the same time, existing self-supervised methods cannot preserve pairwise similarity information between large-scale unlabeled data efficiently and effectively. Thus, a self-supervised temporal sensitive video hashing (TSVH) is proposed in the paper for video retrieval. The TSVH uses a transformer-based autoencoder network with temporal sensitivity regularization to achieve low sensitivity of local temporal perturbations and preserve information of global temporal sequence. The pairwise similarity between video samples is effectively preserved by applying a hashing-based affinity matrix in the method. Experiments on realistic datasets show that the TSVH outperforms several state-of-the-art methods and classic methods.

引用

页码：9021 / 9035

页数：15

共 50 条

[21] Self-Supervised Video Action Localization with Adversarial Temporal Transforms
Gong, Guoqiang
Zheng, Liangfeng
Jiang, Wenhao
Mu, Yadong
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 693 - 699
[22] Self-supervised Bernoulli Autoencoders for Semi-supervised Hashing
Nanculef, Ricardo
Mena, Francisco
Macaluso, Antonio
Lodi, Stefano
Sartori, Claudio
PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2021, 2021, 12702 : 258 - 268
[23] Self-supervised Video Transformer
Ranasinghe, Kanchana
Naseer, Muzammal
Khan, Salman
Khan, Fahad Shahbaz
Ryoo, Michael S.
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2864 - 2874
[24] Learning From Self-Supervised Features for Hashing-Based Remote Sensing Image Retrieval
Tang, Jiayi
Wang, Dali
Tong, Xiaochong
Qiu, Chunping
Yang, Weiming
Lei, Yi
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
[25] Graph Convolutional Network Semantic Enhancement Hashing for Self-supervised Cross-Modal Retrieval
Hu, Jinyu
Li, Mingyong
Zhang, Jiayan
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IV, 2023, 14257 : 410 - 422
[26] Self-supervised Label-Visual Correlation Hashing for Multi-label Image Retrieval
Liu, Yu
Xie, Yanzhao
Song, Jingkuan
Wei, Rukai
Zhou, Ke
WEB AND BIG DATA, PT II, APWEB-WAIM 2022, 2023, 13422 : 129 - 143
[27] Self-Supervised Cluster-Contrast Distillation Hashing Network for Cross-Modal Retrieval
Sun, Haoxuan
Cao, Yudong
Liu, Guangyuan
IEEE ACCESS, 2023, 11 : 96584 - 96593
[28] Temporal Scene Montage for Self-Supervised Video Scene Boundary Detection
Tan, Jiawei
Yang, Pingan
Chen, Lu
Wang, Hongxing
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (07)
[29] TCGL: Temporal Contrastive Graph for Self-Supervised Video Representation Learning
Liu, Yang
Wang, Keze
Liu, Lingbo
Lan, Haoyuan
Lin, Liang
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1978 - 1993
[30] Temporal DINO: A Self-supervised Video Strategy to Enhance Action Prediction
Teeti, Izzeddin
Bhargav, Rongali Sai
Singh, Vivek
Bradley, Andrew
Banerjee, Biplab
Cuzzolin, Fabio
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3273 - 3283

← 1 2 3 4 5 →