Criss-Cross Attention Based Auto Encoder for Video Anomaly Event Detection

被引：1

作者：

Wang, Jiaqi ^{[1
]}

Zhang, Jie ^{[2
]}

Ji, Genlin ^{[2
]}

Sheng, Bo ^{[3
]}

机构：

[1] Nanjing Normal Univ, Sch Math Sci, Nanjing 210023, Peoples R China

[2] Nanjing Normal Univ, Sch Comp & Elect Informat, Nanjing 210023, Peoples R China

[3] Univ Massachusetts, Dept Comp Sci, Boston, MA 02125 USA

来源：

INTELLIGENT AUTOMATION AND SOFT COMPUTING | 2022年 / 34卷 / 03期

基金：

美国国家科学基金会;

关键词：

Video anomaly detection; bi-directional long short-term memory; convolutional autoencoder; Criss-Cross attention module; MIXTURES; NETWORKS;

D O I：

10.32604/iasc.2022.029535

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The surveillance applications generate enormous video data and present challenges to video analysis for huge human labor cost. Reconstruction-based convolutional autoencoders have achieved great success in video anomaly detection for their ability of automatically detecting abnormal event. The approaches learn normal patterns only with the normal data in an unsupervised way due to the difficulty of collecting anomaly samples and obtaining anomaly annotations. But convolutional autoencoders have limitations in global feature extraction for the local receptive field of convolutional kernels. What is more, 2-dimensional convolution lacks the capability of capturing temporal information while videos change over time. In this paper, we propose a method established on Criss-Cross attention based AutoEncoder (CCAE) for capturing global visual features of sequential video frames. The method utilizes Criss-Cross attention based encoder to extract global appearance features. Another Criss-Cross attention module is embedded into bi-directional convolutional long short-term memory in hidden layer to explore global temporal features between frames. A decoder is executed to fuse global appearance and temporal features and reconstruct the frames. We perform extensive experiments on two public datasets UCSD Ped2 and CUHK Avenue. The experimental results demonstrate that CCAE achieves superior detection accuracy compared with other video anomaly detection approaches.

引用

页码：1629 / 1642

页数：14

共 50 条

[21] Appearance-Motion United Auto-Encoder Framework for Video Anomaly Detection
Liu, Yang
Liu, Jing
Lin, Jieyu
Zhao, Mengyang
Song, Liang
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (05) : 2498 - 2502
[22] Criss-Cross Attention Based Multi-level Fusion Network for Gastric Intestinal Metaplasia Segmentation
Nien, Chu-Min
Yang, Er-Hsiang
Chang, Wei-Lun
Cheng, Hsiu-Chi
Huang, Chun-Rong
IMAGING SYSTEMS FOR GI ENDOSCOPY, AND GRAPHS IN BIOMEDICAL IMAGE ANALYSIS, ISGIE 2022, 2022, 13754 : 13 - 23
[23] Anomaly-based Intrusion Detection Using Auto-encoder
Nguimbous, Yves Nsoga
Ksantini, Riadh
Bouhoula, Adel
2019 27TH INTERNATIONAL CONFERENCE ON SOFTWARE, TELECOMMUNICATIONS AND COMPUTER NETWORKS (SOFTCOM), 2019, : 505 - 509
[24] Anomaly detection method based on convolutional variational auto-encoder
Yu X.
Xu M.
Wang Y.
Wang S.
Hu N.
Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2021, 42 (05): : 151 - 158
[25] Patch distance based auto-encoder for industrial anomaly detection
Ma, Zeqi
Li, Jiaxing
Wong, Wai Keung
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 270
[26] CCA-Net: A Lightweight Network Using Criss-Cross Attention for CSI Feedback
Wang, Binghui
Teng, Yinglei
Lau, Vincent
Han, Zhu
IEEE COMMUNICATIONS LETTERS, 2023, 27 (07) : 1879 - 1883
[27] Video Event Restoration Based on Keyframes for Video Anomaly Detection
Yang, Zhiwei
Liu, Jing
Wu, Zhaoyang
Wu, Peng
Liu, Xiaotao
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 14592 - 14601
[28] Cross -modal guidance based auto -encoder for multi -video summarization
Ji, Zhong
Zhao, Yuxiao
Pang, Yanwei
Li, Xuelong
PATTERN RECOGNITION LETTERS, 2020, 135 (135) : 131 - 137
[29] ResCCFusion: Infrared and visible image fusion network based on ResCC module and spatial criss-cross attention models
Xiong, Zhang
Zhang, Xiaohui
Han, Hongwei
Hu, Qingping
INFRARED PHYSICS & TECHNOLOGY, 2024, 136
[30] FS2CCTrans: Frequency-Spatial-Spectral Joint Analysis With Criss-Cross Transformer for Hyperspectral Anomaly Detection
Zhang, Guorong
Sun, Tao
Lu, Fangxiao
Zhang, Shaoquan
Yin, Jie
Wu, Yuhao
Xiong, Zhengqiang
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63

← 1 2 3 4 5 →