Hierarchical grid model for video prediction

被引:0
|
作者
Li, Qinyu [1 ,2 ]
Wu, Siyuan [1 ]
Wang, Hanli [1 ,3 ,4 ]
机构
[1] Tongji Univ, Dept Comp Sci & Technol, Shanghai, Peoples R China
[2] Lanzhou City Univ, Dept Comp Sci, Lanzhou, Peoples R China
[3] Tongji Univ, Minist Educ, Key Lab Embedded Syst & Serv Comp, Shanghai, Peoples R China
[4] Tongji Univ, Shanghai Inst Intelligent Sci & Technol, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Video Prediction; Spatial Transformer Predictor; Convolutional Neural Network; Long Short-term Memory;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video prediction has recently drawn more attention for its application potential. However, it is challenging to model long-term prediction since it has to predict dense pixels along both spatial and temporal dimensions. Several recent approaches for long-term video prediction view pixel transforming as a global process among adjacent frames, while the actual position and motion of pixels in real videos are arranged in a hierarchical manner. Inspired by this, a novel hierarchical prediction model is proposed in this work to decompose complex and composite motions of real videos into simple ones based on their locations. This will reduce learning difficulty and fit various movements as well. In addition, high-resolution videos which are harder to model are also investigated, since there are larger ranges of movement and much more details to take care of. The proposed model builds upon a spatial transformer predictor to realize hierarchical structure to learn motions from videos. The experimental results on the benchmark real-world video dataset Human3.6M demonstrate the effectiveness of the proposed model as compared with other baseline approaches.
引用
收藏
页码:808 / 815
页数:8
相关论文
共 50 条
  • [21] Hierarchical spatiotemporal Feature Interaction Network for video saliency prediction
    Jin, Yingjie
    Zhou, Xiaofei
    Zhang, Zhenjie
    Fang, Hao
    Shi, Ran
    Xu, Xiaobin
    IMAGE AND VISION COMPUTING, 2025, 154
  • [22] Hierarchical resource model and access mechanism of grid workflow
    College of Computer Science and Technology, Huazhong Univ. of Sci. and Technol., Wuhan 430074, China
    Huazhong Ligong Daxue Xuebao, 2006, SUPPL. (37-40):
  • [23] A Hyper Topology Hierarchical Trust Model in Professional Grid
    Wang, Baoyi
    ICICSE: 2008 INTERNATIONAL CONFERENCE ON INTERNET COMPUTING IN SCIENCE AND ENGINEERING, PROCEEDINGS, 2008, : 507 - 512
  • [24] The Grid resource discovery method based on hierarchical model
    Yin, Yulan
    Cui, Huanqing
    Chen, Xin
    Information Technology Journal, 2007, 6 (07) : 1090 - 1094
  • [25] A model to predict the optimal performance of the Hierarchical Data Grid
    Zhang, Junwei
    Lee, Bu-Sung
    Tang, Xueyan
    Yeo, Chai-Kiat
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2010, 26 (01): : 1 - 11
  • [26] Reliable and efficient hierarchical organization model for computational grid
    Abdullah, Aref M.
    Ali, Hesham A.
    Haikal, Amira Y.
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2017, 104 : 191 - 205
  • [27] A Video Quality Prediction Model for the Elderly
    Pal, Debajyoti
    Vanijja, Vajirasak
    WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, : 1121 - 1127
  • [28] A hierarchical access control model for video database systems
    Bertino, E
    Fan, JP
    Ferrari, E
    Hacid, MS
    Elmagarmid, AK
    Zhu, XQ
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2003, 21 (02) : 155 - 191
  • [29] Video Event Recognition with Deep Hierarchical Context Model
    Wang, Xiaoyang
    Ji, Qiang
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 4418 - 4427
  • [30] A Hierarchical Context Model for Event Recognition in Surveillance Video
    Wang, Xiaoyang
    Ji, Qiang
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 2561 - 2568