Criss-Cross Attention Based Auto Encoder for Video Anomaly Event Detection

被引:1
|
作者
Wang, Jiaqi [1 ]
Zhang, Jie [2 ]
Ji, Genlin [2 ]
Sheng, Bo [3 ]
机构
[1] Nanjing Normal Univ, Sch Math Sci, Nanjing 210023, Peoples R China
[2] Nanjing Normal Univ, Sch Comp & Elect Informat, Nanjing 210023, Peoples R China
[3] Univ Massachusetts, Dept Comp Sci, Boston, MA 02125 USA
来源
基金
美国国家科学基金会;
关键词
Video anomaly detection; bi-directional long short-term memory; convolutional autoencoder; Criss-Cross attention module; MIXTURES; NETWORKS;
D O I
10.32604/iasc.2022.029535
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The surveillance applications generate enormous video data and present challenges to video analysis for huge human labor cost. Reconstruction-based convolutional autoencoders have achieved great success in video anomaly detection for their ability of automatically detecting abnormal event. The approaches learn normal patterns only with the normal data in an unsupervised way due to the difficulty of collecting anomaly samples and obtaining anomaly annotations. But convolutional autoencoders have limitations in global feature extraction for the local receptive field of convolutional kernels. What is more, 2-dimensional convolution lacks the capability of capturing temporal information while videos change over time. In this paper, we propose a method established on Criss-Cross attention based AutoEncoder (CCAE) for capturing global visual features of sequential video frames. The method utilizes Criss-Cross attention based encoder to extract global appearance features. Another Criss-Cross attention module is embedded into bi-directional convolutional long short-term memory in hidden layer to explore global temporal features between frames. A decoder is executed to fuse global appearance and temporal features and reconstruct the frames. We perform extensive experiments on two public datasets UCSD Ped2 and CUHK Avenue. The experimental results demonstrate that CCAE achieves superior detection accuracy compared with other video anomaly detection approaches.
引用
收藏
页码:1629 / 1642
页数:14
相关论文
共 50 条
  • [1] Dual Attention Mechanisms Based Auto-Encoder for Video Anomaly Detection
    Gu, Jiatao
    Zeng, Jing
    Ji, Genlin
    ARTIFICIAL INTELLIGENCE AND SECURITY, ICAIS 2022, PT I, 2022, 13338 : 153 - 165
  • [2] Attention-based misaligned spatiotemporal auto-encoder for video anomaly detection
    Yang, Haiyan
    Liu, Shuning
    Wu, Mingxuan
    Chen, Hongbin
    Zeng, Delu
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (SUPPL 1) : 285 - 297
  • [3] CCNet: Criss-Cross Attention for Semantic Segmentation
    Huang, Zilong
    Wang, Xinggang
    Huang, Lichao
    Huang, Chang
    Wei, Yunchao
    Liu, Wenyu
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 603 - 612
  • [4] CCNet: Criss-Cross Attention for Semantic Segmentation
    Huang, Zilong
    Wang, Xinggang
    Wei, Yunchao
    Huang, Lichao
    Shi, Humphrey
    Liu, Wenyu
    Huang, Thomas S.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 6896 - 6908
  • [5] Multi Chunk Learning Based Auto Encoder for Video Anomaly Detection
    Qi, Xiaosha
    Ji, Genlin
    Zhang, Jie
    Sheng, Bo
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 33 (03): : 1861 - 1875
  • [6] Criss-cross global interaction-based selective attention in YOLO for underwater object detection
    Shen, Xin
    Wang, Huibing
    Li, Yafeng
    Gao, Tianzhu
    Fu, Xianping
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 20003 - 20032
  • [7] Criss-cross global interaction-based selective attention in YOLO for underwater object detection
    Xin Shen
    Huibing Wang
    Yafeng Li
    Tianzhu Gao
    Xianping Fu
    Multimedia Tools and Applications, 2024, 83 : 20003 - 20032
  • [8] ATCC: Accurate tracking by criss-cross location attention
    Wu, Yong
    Liu, Zhi
    Zhou, Xiaofei
    Ye, Linwei
    Wang, Yang
    IMAGE AND VISION COMPUTING, 2021, 111
  • [9] Siamese visual tracking based on criss-cross attention and improved head network
    Zhang, Jianming
    Huang, Haitao
    Jin, Xiaokang
    Kuang, Li-Dan
    Zhang, Jin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (1) : 1589 - 1615
  • [10] Event log anomaly detection method based on auto-encoder and control flow
    Kan, Daoyu
    Fang, Xianwen
    MULTIMEDIA SYSTEMS, 2024, 30 (01)