Criss-Cross Attention Based Auto Encoder for Video Anomaly Event Detection

被引:1
|
作者
Wang, Jiaqi [1 ]
Zhang, Jie [2 ]
Ji, Genlin [2 ]
Sheng, Bo [3 ]
机构
[1] Nanjing Normal Univ, Sch Math Sci, Nanjing 210023, Peoples R China
[2] Nanjing Normal Univ, Sch Comp & Elect Informat, Nanjing 210023, Peoples R China
[3] Univ Massachusetts, Dept Comp Sci, Boston, MA 02125 USA
来源
基金
美国国家科学基金会;
关键词
Video anomaly detection; bi-directional long short-term memory; convolutional autoencoder; Criss-Cross attention module; MIXTURES; NETWORKS;
D O I
10.32604/iasc.2022.029535
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The surveillance applications generate enormous video data and present challenges to video analysis for huge human labor cost. Reconstruction-based convolutional autoencoders have achieved great success in video anomaly detection for their ability of automatically detecting abnormal event. The approaches learn normal patterns only with the normal data in an unsupervised way due to the difficulty of collecting anomaly samples and obtaining anomaly annotations. But convolutional autoencoders have limitations in global feature extraction for the local receptive field of convolutional kernels. What is more, 2-dimensional convolution lacks the capability of capturing temporal information while videos change over time. In this paper, we propose a method established on Criss-Cross attention based AutoEncoder (CCAE) for capturing global visual features of sequential video frames. The method utilizes Criss-Cross attention based encoder to extract global appearance features. Another Criss-Cross attention module is embedded into bi-directional convolutional long short-term memory in hidden layer to explore global temporal features between frames. A decoder is executed to fuse global appearance and temporal features and reconstruct the frames. We perform extensive experiments on two public datasets UCSD Ped2 and CUHK Avenue. The experimental results demonstrate that CCAE achieves superior detection accuracy compared with other video anomaly detection approaches.
引用
收藏
页码:1629 / 1642
页数:14
相关论文
共 50 条
  • [21] Appearance-Motion United Auto-Encoder Framework for Video Anomaly Detection
    Liu, Yang
    Liu, Jing
    Lin, Jieyu
    Zhao, Mengyang
    Song, Liang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (05) : 2498 - 2502
  • [22] Criss-Cross Attention Based Multi-level Fusion Network for Gastric Intestinal Metaplasia Segmentation
    Nien, Chu-Min
    Yang, Er-Hsiang
    Chang, Wei-Lun
    Cheng, Hsiu-Chi
    Huang, Chun-Rong
    IMAGING SYSTEMS FOR GI ENDOSCOPY, AND GRAPHS IN BIOMEDICAL IMAGE ANALYSIS, ISGIE 2022, 2022, 13754 : 13 - 23
  • [23] Anomaly-based Intrusion Detection Using Auto-encoder
    Nguimbous, Yves Nsoga
    Ksantini, Riadh
    Bouhoula, Adel
    2019 27TH INTERNATIONAL CONFERENCE ON SOFTWARE, TELECOMMUNICATIONS AND COMPUTER NETWORKS (SOFTCOM), 2019, : 505 - 509
  • [24] Anomaly detection method based on convolutional variational auto-encoder
    Yu X.
    Xu M.
    Wang Y.
    Wang S.
    Hu N.
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2021, 42 (05): : 151 - 158
  • [25] Patch distance based auto-encoder for industrial anomaly detection
    Ma, Zeqi
    Li, Jiaxing
    Wong, Wai Keung
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 270
  • [26] CCA-Net: A Lightweight Network Using Criss-Cross Attention for CSI Feedback
    Wang, Binghui
    Teng, Yinglei
    Lau, Vincent
    Han, Zhu
    IEEE COMMUNICATIONS LETTERS, 2023, 27 (07) : 1879 - 1883
  • [27] Video Event Restoration Based on Keyframes for Video Anomaly Detection
    Yang, Zhiwei
    Liu, Jing
    Wu, Zhaoyang
    Wu, Peng
    Liu, Xiaotao
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 14592 - 14601
  • [28] Cross -modal guidance based auto -encoder for multi -video summarization
    Ji, Zhong
    Zhao, Yuxiao
    Pang, Yanwei
    Li, Xuelong
    PATTERN RECOGNITION LETTERS, 2020, 135 (135) : 131 - 137
  • [29] ResCCFusion: Infrared and visible image fusion network based on ResCC module and spatial criss-cross attention models
    Xiong, Zhang
    Zhang, Xiaohui
    Han, Hongwei
    Hu, Qingping
    INFRARED PHYSICS & TECHNOLOGY, 2024, 136
  • [30] FS2CCTrans: Frequency-Spatial-Spectral Joint Analysis With Criss-Cross Transformer for Hyperspectral Anomaly Detection
    Zhang, Guorong
    Sun, Tao
    Lu, Fangxiao
    Zhang, Shaoquan
    Yin, Jie
    Wu, Yuhao
    Xiong, Zhengqiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63