Efficient 3D Video Engine Using Frame Redundancy

被引:2
|
作者
Peng, Gao [1 ]
Pang, Bo [1 ]
Lu, Cewu [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
D O I
10.1109/WACV48630.2021.00384
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional 3d video understanding methods process videos frame by frame. We argue that a lot of computation in this mechanism is redundant based on a key observation - adjacent frames in 3D videos have visually similar geometry structure. To handle the redundancy, we propose the Efficient 3D Video Engine (EVE), aiming to avoid the computation of redundant points. It consists of two modules: 1) redundancy removing module designed to detect redundancy and remove it; 2) residual learning module to extract features on non-redundant points. As a simple plug and play framework, EVE can be easily incorporated in mainstream 3D models. Experiments demonstrate that EVE can significantly reduce computation without performance loss on large scale datasets. On the other hand, with similar computation, EVE outperforms the strong baseline by up to 4.1 mIoU on SemanticKITTI. The code is available on https://github.com/ecr23xx/eve.
引用
收藏
页码:3791 / 3801
页数:11
相关论文
共 50 条
  • [31] Efficient representation of disoccluded regions in 3D video coding
    Farid, Muhammad Shahid
    Babar, Badi uz Zaman
    Khan, Muhammad Hassan
    ANNALS OF TELECOMMUNICATIONS, 2025, 80 (1-2) : 123 - 137
  • [32] Symmetrical Frame Discard Method for 3D video over IP Networks
    Chung, Young-uk
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2010, 56 (04) : 2790 - 2796
  • [33] Key frame extraction in 3D video by rate-distortion optimization
    Xu, Jianfeng
    Yamasaki, Toshihiko
    Aizawa, Kiyoharu
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1 - +
  • [34] Efficient 3D wavelet transform decomposition for video compression
    Moyano, E
    Quiles, FJ
    Garrido, A
    Orozco-Barbosa, L
    Duato, J
    SECOND INTERNATIONAL WORKSHOP ON DIGITAL AND COMPUTATIONAL VIDEO, PROCEEDINGS, 2001, : 118 - 125
  • [35] Towards key-frame extraction methods for 3D video: a review
    Ferreira, Lino
    da Silva Cruz, Luis A.
    Assuncao, Pedro
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2016,
  • [36] FRAME RATE UP CONVERSION OF 3D VIDEO BY MOTION AND DEPTH FUSION
    Lee, Yeejin
    Lee, Zucheul
    Nguyen, Truong Q.
    2013 IEEE 11TH IVMSP WORKSHOP: 3D IMAGE/VIDEO TECHNOLOGIES AND APPLICATIONS (IVMSP 2013), 2013,
  • [37] MOTION VECTOR REFINEMENT FOR FRAME RATE UP CONVERSION ON 3D VIDEO
    Liu, Yutao
    Fan, Xiaopeng
    Gao, Xinwei
    Liu, Yan
    Zhao, Debin
    2013 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP 2013), 2013,
  • [38] Ada3D: Exploiting the Spatial Redundancy with Adaptive Inference for Efficient 3D Object Detection
    Zhao, Tianchen
    Ning, Xuefei
    Hong, Ke
    Qiu, Zhongyuan
    Lu, Pu
    Zhao, Yali
    Zhang, Linfeng
    Zhou, Lipu
    Dai, Guohao
    Yang, Huazhong
    Wang, Yu
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 17682 - 17692
  • [39] Efficient 2D to 3D video conversion implemented on DSP
    Ramos-Diaz, Eduardo
    Kravchenko, Victor
    Ponomaryov, Volodymyr
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2011, : 1 - 9
  • [40] Efficient 2D to 3D video conversion implemented on DSP
    Eduardo Ramos-Diaz
    Victor Kravchenko
    Volodymyr Ponomaryov
    EURASIP Journal on Advances in Signal Processing, 2011