A GEOMETRIC CONVOLUTIONAL NEURAL NETWORK FOR 3D OBJECT DETECTION

被引:0
|
作者
Lu, Yawen [1 ]
Guo, Qianyu [1 ,2 ]
Lu, Guoyu [1 ]
机构
[1] Rochester Inst Technol, Chester Carlson Ctr Imaging Sci, Rochester, NY 14623 USA
[2] Shanxi Univ, Sch Comp & Informat Technol, Taiyuan, Shanxi, Peoples R China
关键词
3D Object detection; Depth estimation; Convolutional neural network;
D O I
10.1109/globalsip45357.2019.8969077
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a method for accurate 3D vehicle detection based on geometric deep neural networks. From only a single RGB image, the framework is able to recover the 3D positions and predict 3D bounding boxes. In particular, the algorithm leverages single image depth estimation and semantic segmentation to produce 3D point cloud for specific objects. By geometrically constraining the object dimensions, an accurate and stable 3D bounding box which tightly fits into the real object can be estimated. We verify the effectiveness and robustness of our method by comparing with other recent state-of-art methods on the challenging KITTI 3D benchmark dataset as well as synthetic Virtual KITTI dataset. Without requiring ground truth 3D labels, our method is able to produce competitive and robust performance in 3D scene understanding and detection.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Lightweight convolutional neural network for real-time 3D object detection in road and railway environments
    A. Mauri
    R. Khemmar
    B. Decoux
    M. Haddad
    R. Boutteau
    Journal of Real-Time Image Processing, 2022, 19 : 499 - 516
  • [22] Convolutional Neural Network for 3D Object Recognition Based on RGB-D Dataset
    Wang, Jianhua
    Lu, Jinjin
    Chen, Weihai
    Wu, Xingming
    PROCEEDINGS OF THE 2015 10TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, 2015, : 34 - 39
  • [23] RGB-D Salient Object Detection via 3D Convolutional Neural Networks
    Chen, Qian
    Liu, Ze
    Zhang, Yi
    Fu, Keren
    Zhao, Qijun
    Du, Hongwei
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1063 - 1071
  • [24] VoxNet: A 3D Convolutional Neural Network for Real-Time Object Recognition
    Maturana, Daniel
    Scherer, Sebastian
    2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 922 - 928
  • [25] An Improved Multi-View Convolutional Neural Network for 3D Object Retrieval
    He, Xinwei
    Bai, Song
    Chu, Jiajia
    Bai, Xiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 7917 - 7930
  • [26] Violence Detection Using Spatiotemporal Features with 3D Convolutional Neural Network
    Ullah, Fath U. Min
    Ullah, Amin
    Muhammad, Khan
    Ul Haq, Ijaz
    Baik, Sung Wook
    SENSORS, 2019, 19 (11)
  • [27] Detection of deleted frames on videos using a 3D Convolutional Neural Network
    Voronin, V.
    Sizyakin, R.
    Zelensky
    Nadykto, A.
    Svirin, I.
    COUNTERTERRORISM, CRIME FIGHTING, FORENSICS, AND SURVEILLANCE TECHNOLOGIES II, 2018, 10802
  • [28] Smoke Video Detection Algorithm Based On 3D Convolutional Neural Network
    Shi, Zhen
    Sun, Rui
    Huo, Mingge
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 692 - 697
  • [29] Video Visual Relation Detection via 3D Convolutional Neural Network
    Qu, Mingcheng
    Cui, Jianxun
    Su, Tonghua
    Deng, Ganlin
    Shao, Wenkai
    IEEE ACCESS, 2022, 10 : 23748 - 23756
  • [30] Filter-Pruned 3D Convolutional Neural Network for Drowsiness Detection
    Yao, Heming
    Zhang, Wei
    Malhan, Rajesh
    Gryak, Jonathan
    Najarian, Kayvan
    2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 1258 - 1262