Hierarchical grid model for video prediction

被引:0
|
作者
Li, Qinyu [1 ,2 ]
Wu, Siyuan [1 ]
Wang, Hanli [1 ,3 ,4 ]
机构
[1] Tongji Univ, Dept Comp Sci & Technol, Shanghai, Peoples R China
[2] Lanzhou City Univ, Dept Comp Sci, Lanzhou, Peoples R China
[3] Tongji Univ, Minist Educ, Key Lab Embedded Syst & Serv Comp, Shanghai, Peoples R China
[4] Tongji Univ, Shanghai Inst Intelligent Sci & Technol, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Video Prediction; Spatial Transformer Predictor; Convolutional Neural Network; Long Short-term Memory;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video prediction has recently drawn more attention for its application potential. However, it is challenging to model long-term prediction since it has to predict dense pixels along both spatial and temporal dimensions. Several recent approaches for long-term video prediction view pixel transforming as a global process among adjacent frames, while the actual position and motion of pixels in real videos are arranged in a hierarchical manner. Inspired by this, a novel hierarchical prediction model is proposed in this work to decompose complex and composite motions of real videos into simple ones based on their locations. This will reduce learning difficulty and fit various movements as well. In addition, high-resolution videos which are harder to model are also investigated, since there are larger ranges of movement and much more details to take care of. The proposed model builds upon a spatial transformer predictor to realize hierarchical structure to learn motions from videos. The experimental results on the benchmark real-world video dataset Human3.6M demonstrate the effectiveness of the proposed model as compared with other baseline approaches.
引用
收藏
页码:808 / 815
页数:8
相关论文
共 50 条
  • [1] Probabilistic congestion prediction in hierarchical quad-grid model
    Yan, JT
    Chen, YH
    Wu, CW
    2005 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), VOLS 1-6, CONFERENCE PROCEEDINGS, 2005, : 1350 - 1353
  • [2] A Hierarchical Variational Neural Uncertainty Model for Stochastic Video Prediction
    Chatterjee, Moitreya
    Ahuja, Narendra
    Cherian, Anoop
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9731 - 9741
  • [3] Adaptive Hierarchical Motion-Focused Model for Video Prediction
    Tang, Min
    Wang, Wenmin
    Chen, Xiongtao
    He, Yifeng
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 579 - 588
  • [4] Unsupervised Hierarchical Disentanglement for Video Prediction
    Motallebi, Mohammad Reza
    Westfechtel, Thomas
    Li, Yang
    Harada, Tatsuya
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 470 - 476
  • [5] A Power Grid Equipment Fault Prediction Model Based on Faster RCNN and Video Streaming
    Feng, Youjun
    Luo, Jinman
    Wen, Zhaocong
    Yao, Zirui
    ARTIFICIAL INTELLIGENCE AND SECURITY, ICAIS 2022, PT II, 2022, 13339 : 161 - 172
  • [6] Hierarchical semantic associative video model
    Cheng, Y
    Xu, D
    PROCEEDINGS OF 2003 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS & SIGNAL PROCESSING, PROCEEDINGS, VOLS 1 AND 2, 2003, : 1217 - 1220
  • [7] Hierarchical simulation model for MPEG video
    Feng, J
    Lo, KT
    THIRD IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, PROCEEDINGS, 1998, : 473 - 477
  • [8] Hierarchical traffic model for MPEG video
    City Univ of Hong Kong, Hong Kong, China
    Chin J Electron, 1 (78-85):
  • [9] Study of storage model in the video grid
    Zhao, Hong
    Yin, Lei
    Shen, Chao
    Journal of Beijing Institute of Technology (English Edition), 2010, 19 (SUPPL. 2): : 61 - 65
  • [10] Hierarchical wavelet video coding using warping prediction
    Wien, M
    2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2000, : 142 - 145