Triangulation Learning Network: from Monocular to Stereo 3D Object Detection

被引:84
|
作者
Qin, Zengyi [1 ]
Wang, Jinglu [2 ]
Lu, Yan [2 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] Microsoft Res, Beijing, Peoples R China
关键词
D O I
10.1109/CVPR.2019.00780
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we study the problem of 3D object detection from stereo images, in which the key challenge is how to effectively utilize stereo information. Different from previous methods using pixel-level depth maps, we propose employing 3D anchors to explicitly construct object-level correspondences between the regions of interest in stereo images, from which the deep neural network learns to detect and triangulate the targeted object in 3D space. We also introduce a cost-efficient channel reweighting strategy that enhances representational features and weakens noisy signals to facilitate the learning process. All of these are flexibly integrated into a solid baseline detector that uses monocular images. We demonstrate that both the monocular baseline and the stereo triangulation learning network outperform the prior state-of-the-arts in 3D object detection and localization on the challenging KITTI dataset.
引用
收藏
页码:7607 / 7615
页数:9
相关论文
共 50 条
  • [41] Monocular Object Detection Using 3D Geometric Primitives
    Carr, Peter
    Sheikh, Yaser
    Matthews, Iain
    COMPUTER VISION - ECCV 2012, PT I, 2012, 7572 : 864 - 878
  • [42] Dense-JANet for Monocular 3D Object Detection
    Shang, Xiaoqing
    Cheng, Zhiwei
    Shi, Su
    Cheng, Zhuanghao
    Huang, Hongcheng
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [43] MonoCD: Monocular 3D Object Detection with Complementary Depths
    Yan, Longfei
    Yan, Pei
    Xiong, Shengzhou
    Xiang, Xuanyu
    Tan, Yihua
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 10248 - 10257
  • [44] Monocular 3D object detection for an indoor robot environment
    Kim, Jiwon
    Lee, GiJae
    Kim, Jun-Sik
    Kim, Hyunwoo J.
    Kim, KangGeon
    2020 29TH IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2020, : 438 - 445
  • [45] Competition for roadside camera monocular 3D object detection
    Jia, Jinrang
    Shi, Yifeng
    Qu, Yuli
    Wang, Rui
    Xu, Xing
    Zhang, Hai
    NATIONAL SCIENCE REVIEW, 2023, 10 (06)
  • [46] Objects are Different: Flexible Monocular 3D Object Detection
    Zhang, Yunpeng
    Lu, Jiwen
    Zhou, Jie
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3288 - 3297
  • [47] Monocular 3D object detection for construction scene analysis
    Shen, Jie
    Jiao, Lang
    Zhang, Cong
    Peng, Keran
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2024, 39 (09) : 1370 - 1389
  • [48] Delving into Localization Errors for Monocular 3D Object Detection
    Ma, Xinzhu
    Zhang, Yinmin
    Xu, Dan
    Zhou, Dongzhan
    Yi, Shuai
    Li, Haojie
    Ouyang, Wanli
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4719 - 4728
  • [49] Shape-Aware Monocular 3D Object Detection
    Chen, Wei
    Zhao, Jie
    Zhao, Wan-Lei
    Wu, Song-Yuan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (06) : 6416 - 6424
  • [50] Competition for roadside camera monocular 3D object detection
    Jinrang Jia
    Yifeng Shi
    Yuli Qu
    Rui Wang
    Xing Xu
    Hai Zhang
    NationalScienceReview, 2023, 10 (06) : 34 - 37