Deep Learning in Latent Space for Video Prediction and Compression

被引:45
|
作者
Liu, Bowen [1 ]
Chen, Yu [1 ]
Liu, Shiyu [1 ]
Kim, Hun-Seok [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
关键词
EVENT DETECTION;
D O I
10.1109/CVPR46437.2021.00076
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning-based video compression has achieved substantial progress during recent years. The most influential approaches adopt deep neural networks (DNNs) to remove spatial and temporal redundancies by finding the appropriate lower-dimensional representations of frames in the video. We propose a novel DNN based framework that predicts and compresses video sequences in the latent vector space. The proposed method first learns the efficient lower-dimensional latent space representation of each video frame and then performs inter-frame prediction in that latent domain. The proposed latent domain compression of individual frames is obtained by a deep autoencoder trained with a generative adversarial network (GAN). To exploit the temporal correlation within the video frame sequence, we employ a convolutional long short-term memory (ConvLSTM) network to predict the latent vector representation of the future frame. We demonstrate our method with two applications; video compression and abnormal event detection that share the identical latent frame prediction network. The proposed method exhibits superior or competitive performance compared to the state-of-the-art algorithms specifically designed for either video compression or anomaly detection.(1)
引用
收藏
页码:701 / 710
页数:10
相关论文
共 50 条
  • [31] Deep learning-guided video compression for machine vision tasks
    Kim, Aro
    Woo, Seung-taek
    Park, Minho
    Kim, Dong-hwi
    Lim, Hanshin
    Jung, Soon-heung
    Kwak, Sangwoon
    Park, Sang-hyo
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2024, 2024 (01)
  • [32] Deep Learning Based Video Compression Techniques with Future Research Issues
    Joy, Helen K. K.
    Kounte, Manjunath R. R.
    Chandrasekhar, Arunkumar
    Paul, Manoranjan
    WIRELESS PERSONAL COMMUNICATIONS, 2023, 131 (04) : 2599 - 2625
  • [33] Guest Editorial: Special Issue on Deep Learning for Video Analysis and Compression
    Xu, Dong
    Chellappa, Rama
    Van Gool, Luc
    Lu, Guo
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (12) : 3171 - 3173
  • [34] Deep Learning Based Video Compression Techniques with Future Research Issues
    Helen K. Joy
    Manjunath R. Kounte
    Arunkumar Chandrasekhar
    Manoranjan Paul
    Wireless Personal Communications, 2023, 131 : 2599 - 2625
  • [35] Innovative Insights: A Review of Deep Learning Methods for Enhanced Video Compression
    Khadir, Mohammad
    Farukh Hashmi, Mohammad
    Kotambkar, Deepali M.
    Gupta, Aditya
    IEEE ACCESS, 2024, 12 : 125706 - 125725
  • [36] Editorial: Introduction to the Issue on Deep Learning for Image/Video Restoration and Compression
    Tekalp, A. Murat
    Covell, Michele
    Timofte, Radu
    Dong, Chao
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2021, 15 (02) : 157 - 161
  • [37] Deep learning-based Feature compression for Video Coding for Machine
    Do, Jihoon
    Lee, Jooyoung
    Kim, Younhee
    Jeong, Se Yoon
    Choi, Jin Soo
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2022, 2022, 12177
  • [38] Learning-Based Video Coding with Joint Deep Compression and Enhancement
    Zhao, Tiesong
    Feng, Weize
    Zeng, Hongji
    Xu, Yiwen
    Niu, Yuzhen
    Liu, Jiaying
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3045 - 3054
  • [39] Deep Generative Video Compression
    Han, Jun
    Lombardo, Salvator
    Schroers, Christopher
    Mandt, Stephan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [40] Deep Hierarchical Video Compression
    Lu, Ming
    Duan, Zhihao
    Zhu, Fengqing
    Ma, Zhan
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 8, 2024, : 8859 - 8867