Multi-Label Semantic 3D Reconstruction using Voxel Blocks

被引:41
|
作者
Cherabier, Ian [1 ]
Haene, Christian [2 ]
Oswald, Martin R. [1 ]
Pollefeys, Marc [1 ,3 ]
机构
[1] Swiss Fed Inst Technol, Zurich, Switzerland
[2] Univ Calif Berkeley, Berkeley, CA 94720 USA
[3] Microsoft, Redmond, WA USA
关键词
D O I
10.1109/3DV.2016.68
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Techniques that jointly perform dense 3D reconstruction and semantic segmentation have recently shown very promising results. One major restriction so far is that they can often only handle a very low number of semantic labels. This is mostly due to their high memory consumption caused by the necessity to store indicator variables for every label and transition. We propose a way to reduce the memory consumption of existing methods. Our approach is based on the observation that many semantic labels are only present at very localized positions in the scene, such as cars. Therefore this label does not need to be active at every location. We exploit this observation by dividing the scene into blocks in which generally only a subset of labels is active. By determining early on in the reconstruction process which labels need to be active in which block the memory consumption can be significantly reduced. In order to recover from mistakes we propose to update the set of active labels during the iterative optimization procedure based on the current solution. We also propose a way to initialize the set of active labels using a boosted classifier. In our experimental evaluation we show the reduction of memory usage quantitatively. Eventually, we show results of joint semantic 3D reconstruction and semantic segmentation with significantly more labels than previous approaches were able to handle.
引用
收藏
页码:601 / 610
页数:10
相关论文
共 50 条
  • [1] Multi-label HD Classification in 3D Flash
    Morris, Justin
    Hao, Yilun
    Gupta, Saransh
    Ramkumar, Ranganathan
    Yu, Jeffrey
    Imani, Mohsen
    Aksanli, Baris
    Rosing, Tajana
    2020 IFIP/IEEE 28TH INTERNATIONAL CONFERENCE ON VERY LARGE SCALE INTEGRATION (VLSI-SOC), 2020, : 10 - 15
  • [2] Estimating 3D hand pose using hierarchical multi-label classification
    Stenger, B.
    Thayananthan, A.
    Torr, P. H. S.
    Cipolla, R.
    IMAGE AND VISION COMPUTING, 2007, 25 (12) : 1885 - 1894
  • [3] Multi-label weak-label learning via semantic reconstruction and label correlations
    Zhao, Dawei
    Li, Hong
    Lu, Yixiang
    Sun, Dong
    Zhu, De
    Gao, Qingwei
    INFORMATION SCIENCES, 2023, 623 : 379 - 401
  • [4] 3D reconstruction of plants using probabilistic voxel carving
    Feng, Jiale
    Saadat, Mojdeh
    Jubery, Talukder
    Jignasu, Anushrut
    Balu, Aditya
    Li, Yawei
    Attigala, Lakshmi
    Schnable, Patrick S.
    Sarka, Soumik
    Ganapathysubramanian, Baskar
    Krishnamurthy, Adarsh
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 213
  • [5] Unsupervised co-segmentation for 3D shapes using iterative multi-label optimization
    Meng, Min
    Xia, Jiazhi
    Luo, Jun
    He, Ying
    COMPUTER-AIDED DESIGN, 2013, 45 (02) : 312 - 320
  • [6] TMVNet : Using Transformers for Multi-view Voxel-based 3D Reconstruction
    Peng, Kebin
    Islam, Rifatul
    Quarles, John
    Desai, Kevin
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 221 - 229
  • [7] Crowdsourced Semantic Matching of Multi-Label Annotations
    Duan, Lei
    Oyama, Satoshi
    Kurihara, Masahito
    Sato, Haruhiko
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 3483 - 3489
  • [8] 3d indoor point cloud semantic segmentation using image and voxel
    Yeom S.-S.
    Ha J.-E.
    Ha, Jong-Eun (jeha@seoultech.ac.kr), 1600, Institute of Control, Robotics and Systems (27): : 1000 - 1007
  • [9] Dense Semantic 3D Reconstruction
    Hane, Christian
    Zach, Christopher
    Cohen, Andrea
    Pollefeys, Marc
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (09) : 1730 - 1743
  • [10] Semantic 3D Reconstruction of Heads
    Maninchedda, Fabio
    Haene, Christian
    Jacquet, Bastien
    Delaunoy, Amael
    Pollefeys, Marc
    COMPUTER VISION - ECCV 2016, PT VI, 2016, 9910 : 667 - 683