Criss-Cross Attention Based Auto Encoder for Video Anomaly Event Detection

被引：1

作者：

Wang, Jiaqi ^{[1
]}

Zhang, Jie ^{[2
]}

Ji, Genlin ^{[2
]}

Sheng, Bo ^{[3
]}

机构：

[1] Nanjing Normal Univ, Sch Math Sci, Nanjing 210023, Peoples R China

[2] Nanjing Normal Univ, Sch Comp & Elect Informat, Nanjing 210023, Peoples R China

[3] Univ Massachusetts, Dept Comp Sci, Boston, MA 02125 USA

来源：

INTELLIGENT AUTOMATION AND SOFT COMPUTING | 2022年 / 34卷 / 03期

基金：

美国国家科学基金会;

关键词：

Video anomaly detection; bi-directional long short-term memory; convolutional autoencoder; Criss-Cross attention module; MIXTURES; NETWORKS;

D O I：

10.32604/iasc.2022.029535

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The surveillance applications generate enormous video data and present challenges to video analysis for huge human labor cost. Reconstruction-based convolutional autoencoders have achieved great success in video anomaly detection for their ability of automatically detecting abnormal event. The approaches learn normal patterns only with the normal data in an unsupervised way due to the difficulty of collecting anomaly samples and obtaining anomaly annotations. But convolutional autoencoders have limitations in global feature extraction for the local receptive field of convolutional kernels. What is more, 2-dimensional convolution lacks the capability of capturing temporal information while videos change over time. In this paper, we propose a method established on Criss-Cross attention based AutoEncoder (CCAE) for capturing global visual features of sequential video frames. The method utilizes Criss-Cross attention based encoder to extract global appearance features. Another Criss-Cross attention module is embedded into bi-directional convolutional long short-term memory in hidden layer to explore global temporal features between frames. A decoder is executed to fuse global appearance and temporal features and reconstruct the frames. We perform extensive experiments on two public datasets UCSD Ped2 and CUHK Avenue. The experimental results demonstrate that CCAE achieves superior detection accuracy compared with other video anomaly detection approaches.

引用

页码：1629 / 1642

页数：14

共 50 条

[1] Dual Attention Mechanisms Based Auto-Encoder for Video Anomaly Detection
Gu, Jiatao
Zeng, Jing
Ji, Genlin
ARTIFICIAL INTELLIGENCE AND SECURITY, ICAIS 2022, PT I, 2022, 13338 : 153 - 165
[2] Attention-based misaligned spatiotemporal auto-encoder for video anomaly detection
Yang, Haiyan
Liu, Shuning
Wu, Mingxuan
Chen, Hongbin
Zeng, Delu
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (SUPPL 1) : 285 - 297
[3] CCNet: Criss-Cross Attention for Semantic Segmentation
Huang, Zilong
Wang, Xinggang
Huang, Lichao
Huang, Chang
Wei, Yunchao
Liu, Wenyu
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 603 - 612
[4] CCNet: Criss-Cross Attention for Semantic Segmentation
Huang, Zilong
Wang, Xinggang
Wei, Yunchao
Huang, Lichao
Shi, Humphrey
Liu, Wenyu
Huang, Thomas S.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 6896 - 6908
[5] Multi Chunk Learning Based Auto Encoder for Video Anomaly Detection
Qi, Xiaosha
Ji, Genlin
Zhang, Jie
Sheng, Bo
INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 33 (03): : 1861 - 1875
[6] Criss-cross global interaction-based selective attention in YOLO for underwater object detection
Shen, Xin
Wang, Huibing
Li, Yafeng
Gao, Tianzhu
Fu, Xianping
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 20003 - 20032
[7] Criss-cross global interaction-based selective attention in YOLO for underwater object detection
Xin Shen
Huibing Wang
Yafeng Li
Tianzhu Gao
Xianping Fu
Multimedia Tools and Applications, 2024, 83 : 20003 - 20032
[8] ATCC: Accurate tracking by criss-cross location attention
Wu, Yong
Liu, Zhi
Zhou, Xiaofei
Ye, Linwei
Wang, Yang
IMAGE AND VISION COMPUTING, 2021, 111
[9] Siamese visual tracking based on criss-cross attention and improved head network
Zhang, Jianming
Huang, Haitao
Jin, Xiaokang
Kuang, Li-Dan
Zhang, Jin
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (1) : 1589 - 1615
[10] Event log anomaly detection method based on auto-encoder and control flow
Kan, Daoyu
Fang, Xianwen
MULTIMEDIA SYSTEMS, 2024, 30 (01)

← 1 2 3 4 5 →