Temporal capsule networks for video motion estimation and error concealment

被引:8
|
作者
Sankisa, Arun [1 ]
Punjabi, Arjun [1 ]
Katsaggelos, Aggelos K. [1 ]
机构
[1] Northwestern Univ, Dept Elect & Comp Engn, Evanston, IL 60208 USA
关键词
Capsule networks; Conv3D; ConvLSTM; Error concealment; Motion estimation;
D O I
10.1007/s11760-020-01671-x
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we present a temporal capsule network architecture to encode motion in videos as an instantiation parameter. The extracted motion is used to perform motion-compensated error concealment. We modify the original architecture and use a carefully curated dataset to enable the training of capsules spatially and temporally. First, we add the temporal dimension by taking co-located "patches" from three consecutive frames obtained from standard video sequences to form input data "cubes." Second, the network is designed with an initial feature extraction layer that operates on all three dimensions to generate spatiotemporal features. Additionally, we implement the PrimaryCaps module with a recurrent layer, instead of a conventional convolutional layer, to extract short-term motion-related temporal dependencies and encode them as activation vectors in the capsule output. Finally, the capsule output is combined with the most-recent past frame and passed through a fully connected reconstruction network to perform motion-compensated error concealment. We study the effectiveness of temporal capsules by comparing the proposed model with architectures that do not include capsules. Although the quality of the reconstruction shows room for improvement, we successfully demonstrate that capsules-based architectures can be designed to operate in the temporal dimension to encode motion-related attributes as instantiation parameters. The accuracy of motion estimation is evaluated by comparing both the reconstructed frame outputs and the corresponding optical flow estimates with ground truth data.
引用
收藏
页码:1369 / 1377
页数:9
相关论文
共 50 条
  • [21] VIDEO ERROR CONCEALMENT USING DEEP NEURAL NETWORKS
    Sankisa, Arun
    Punjabi, Arjun
    Katsaggelos, Aggelos K.
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 380 - 384
  • [22] An iterative temporal error concealment algorithm for degraded video signals
    Kim, YG
    Choe, Y
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2001, E84B (04) : 941 - 951
  • [23] Motion vector based error concealment algorithms for video decoder
    Chen, MJ
    Chen, CS
    Hsu, CT
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2004, E87B (06) : 1648 - 1659
  • [24] Variable complexity motion compensated error concealment in video coding
    Garg, S
    Merchant, SN
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2005, 152 (06): : 757 - 762
  • [25] HYBRID SPATIAL AND TEMPORAL ERROR CONCEALMENT FOR DISTRIBUTED VIDEO CODING
    Ye, Shuiming
    Ouaret, Mourad
    Dufaux, Frederic
    Ebrahimi, Touradj
    2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 633 - 636
  • [26] Dynamic Temporal Error Concealment for Video Data in Error-prone Environments
    Marvasti-Zadeh, Seyed Mojtaba
    Ghanei-Yakhdan, Hossein
    Kasaei, Shohreh
    2013 8TH IRANIAN CONFERENCE ON MACHINE VISION & IMAGE PROCESSING (MVIP 2013), 2013, : 43 - 47
  • [27] SPATIO-TEMPORAL ERROR CONCEALMENT IN VIDEO BY DENOISED TEMPORAL EXTRAPOLATION REFINEMENT
    Seiler, Juergen
    Schoeberl, Michael
    Kaup, Andre
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 1613 - 1616
  • [28] Temporal error concealment using selective motion field interpolation
    Chen, B. N.
    Lin, Y.
    ELECTRONICS LETTERS, 2006, 42 (24) : 1390 - 1392
  • [29] An efficient temporal error concealment algorithm based on temporal and spatial correlation of motion
    Huang Z.
    Yi B.
    Yu Z.
    Gaojishu Tongxin/Chinese High Technology Letters, 2010, 20 (06): : 565 - 570
  • [30] ADAPTIVE ERROR CONCEALMENT FOR MULTIPLE DESCRIPTION VIDEO CODING USING ERROR ESTIMATION
    Gadgil, Neeraj
    Comer, Mary L.
    Delp, Edward J.
    2013 PICTURE CODING SYMPOSIUM (PCS), 2013, : 97 - 100