Efficient 3D Video Engine Using Frame Redundancy

被引:2
|
作者
Peng, Gao [1 ]
Pang, Bo [1 ]
Lu, Cewu [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
10.1109/WACV48630.2021.00384
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional 3d video understanding methods process videos frame by frame. We argue that a lot of computation in this mechanism is redundant based on a key observation - adjacent frames in 3D videos have visually similar geometry structure. To handle the redundancy, we propose the Efficient 3D Video Engine (EVE), aiming to avoid the computation of redundant points. It consists of two modules: 1) redundancy removing module designed to detect redundancy and remove it; 2) residual learning module to extract features on non-redundant points. As a simple plug and play framework, EVE can be easily incorporated in mainstream 3D models. Experiments demonstrate that EVE can significantly reduce computation without performance loss on large scale datasets. On the other hand, with similar computation, EVE outperforms the strong baseline by up to 4.1 mIoU on SemanticKITTI. The code is available on https://github.com/ecr23xx/eve.
引用
收藏
页码:3791 / 3801
页数:11
相关论文
共 50 条
  • [41] 3D CBVR: AN EFFICIENT VIDEO RETRIEVAL USING MAPREDUCE FRAMEWORK WITH TB-PCT
    Kumar, C. Ranjith
    Suguna, S.
    2016 INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT), VOL 1, 2016, : 7 - 11
  • [42] Dense 3D Reconstruction from High Frame-Rate Video Using a Static Grid Pattern
    Sagawa, Ryusuke
    Furukawa, Ryo
    Kawasaki, Hiroshi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (09) : 1733 - 1747
  • [43] A Robust 3D Video Watermarking Scheme Based on Multi-modal Visual Redundancy
    Cheng, Congxin
    Ma, Wei
    Yang, Yuchen
    Zhang, Shiyang
    Zheng, Mana
    IMAGE AND GRAPHICS (ICIG 2017), PT III, 2017, 10668 : 517 - 526
  • [45] Video coding using streamed 3D representation
    Galpin, F
    Morin, L
    2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2000, : 636 - 639
  • [46] VIDEO COMPRESSION USING 3D WAVELET TRANSFORMS
    LEWIS, AS
    KNOWLES, G
    ELECTRONICS LETTERS, 1990, 26 (06) : 396 - 398
  • [47] Image Frame Fusion using 3D Anisotropic Diffusion
    Kahraman, Fatih
    Mendi, C. Deniz
    Gokmen, Muhittin
    23RD INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2008, : 605 - +
  • [48] Detection of Fake 3D Video Using CNN
    Rana, Shuvendu
    Gaj, Sibaji
    Sur, Arijit
    Bora, Prabin Kumar
    2016 IEEE 18TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2016,
  • [49] 3D VR engine
    Liao, HS
    Li, BY
    Chang, CH
    Chu, SL
    SEVENTH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND GRID IN ASIA PACIFIC REGION, PROCEEDINGS, 2004, : 289 - 292
  • [50] Analysis of 3D frame problems by DQEM using EDQ
    Chen, CN
    ADVANCES IN ENGINEERING SOFTWARE, 2001, 32 (05) : 395 - 407