Efficient 3D Video Engine Using Frame Redundancy

被引：2

作者：

Peng, Gao ^{[1
]}

Pang, Bo ^{[1
]}

Lu, Cewu ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China

来源：

2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021 | 2021年

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

D O I：

10.1109/WACV48630.2021.00384

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Traditional 3d video understanding methods process videos frame by frame. We argue that a lot of computation in this mechanism is redundant based on a key observation - adjacent frames in 3D videos have visually similar geometry structure. To handle the redundancy, we propose the Efficient 3D Video Engine (EVE), aiming to avoid the computation of redundant points. It consists of two modules: 1) redundancy removing module designed to detect redundancy and remove it; 2) residual learning module to extract features on non-redundant points. As a simple plug and play framework, EVE can be easily incorporated in mainstream 3D models. Experiments demonstrate that EVE can significantly reduce computation without performance loss on large scale datasets. On the other hand, with similar computation, EVE outperforms the strong baseline by up to 4.1 mIoU on SemanticKITTI. The code is available on https://github.com/ecr23xx/eve.

引用

页码：3791 / 3801

页数：11

共 50 条

[31] Efficient representation of disoccluded regions in 3D video coding
Farid, Muhammad Shahid
Babar, Badi uz Zaman
Khan, Muhammad Hassan
ANNALS OF TELECOMMUNICATIONS, 2025, 80 (1-2) : 123 - 137
[32] Symmetrical Frame Discard Method for 3D video over IP Networks
Chung, Young-uk
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2010, 56 (04) : 2790 - 2796
[33] Key frame extraction in 3D video by rate-distortion optimization
Xu, Jianfeng
Yamasaki, Toshihiko
Aizawa, Kiyoharu
2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1 - +
[34] Efficient 3D wavelet transform decomposition for video compression
Moyano, E
Quiles, FJ
Garrido, A
Orozco-Barbosa, L
Duato, J
SECOND INTERNATIONAL WORKSHOP ON DIGITAL AND COMPUTATIONAL VIDEO, PROCEEDINGS, 2001, : 118 - 125
[35] Towards key-frame extraction methods for 3D video: a review
Ferreira, Lino
da Silva Cruz, Luis A.
Assuncao, Pedro
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2016,
[36] FRAME RATE UP CONVERSION OF 3D VIDEO BY MOTION AND DEPTH FUSION
Lee, Yeejin
Lee, Zucheul
Nguyen, Truong Q.
2013 IEEE 11TH IVMSP WORKSHOP: 3D IMAGE/VIDEO TECHNOLOGIES AND APPLICATIONS (IVMSP 2013), 2013,
[37] MOTION VECTOR REFINEMENT FOR FRAME RATE UP CONVERSION ON 3D VIDEO
Liu, Yutao
Fan, Xiaopeng
Gao, Xinwei
Liu, Yan
Zhao, Debin
2013 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP 2013), 2013,
[38] Ada3D: Exploiting the Spatial Redundancy with Adaptive Inference for Efficient 3D Object Detection
Zhao, Tianchen
Ning, Xuefei
Hong, Ke
Qiu, Zhongyuan
Lu, Pu
Zhao, Yali
Zhang, Linfeng
Zhou, Lipu
Dai, Guohao
Yang, Huazhong
Wang, Yu
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 17682 - 17692
[39] Efficient 2D to 3D video conversion implemented on DSP
Ramos-Diaz, Eduardo
Kravchenko, Victor
Ponomaryov, Volodymyr
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2011, : 1 - 9
[40] Efficient 2D to 3D video conversion implemented on DSP
Eduardo Ramos-Diaz
Victor Kravchenko
Volodymyr Ponomaryov
EURASIP Journal on Advances in Signal Processing, 2011

← 1 2 3 4 5 →