Three-Attention Mechanisms for One-Stage 3-D Object Detection Based on LiDAR and Camera

被引:27
|
作者
Wen, Li-Hua [1 ]
Jo, Kang-Hyun [1 ]
机构
[1] Univ Ulsan, Grad Sch Elect Engn, Ulsan 44610, South Korea
关键词
Camera; LiDAR; one-stage; three attention mechanisms; 3-D object detection;
D O I
10.1109/TII.2020.3048719
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article studies one-stage 3-D object detection based on light detection and ranging (LiDAR) point clouds and red-green-blue (RGB) images that aims to boost 3-D object detection accuracy based on three attention mechanisms. Currently, most of the previous works converted LiDAR point clouds into bird's-eye-view (BEV) images, achieving a significant performance. However, they still have a problem due to partial height information (z-axis value) loss during the conversion. To eliminate this problem, the height information of the LiDAR point clouds is projected onto an RGB image and embedded into the original RGB image to generate a new image, named RGBD. This is the first attention mechanism to improve 3-D detection accuracy. Moreover, two other attention mechanisms extract more discriminative global and local features, respectively. Specifically, the global attention network is appended to a feature encoder, and the local attention network is used for the view-specific region of interest fusion. Massive experiments evaluated on the KITTI benchmark suite show that the proposed approach outperforms state-of-the-art LiDAR-Camera-based methods on the car class (easy, moderate, hard): 2-D (90.35%, 88.47%, 86.98%), 3-D (85.12%, 76.23%, 74.46%), and BEV (89.64%, 86.23%, 85.60%).
引用
收藏
页码:6655 / 6663
页数:9
相关论文
共 50 条
  • [1] A Lightweight One-Stage 3D Object Detector Based on LiDAR and Camera Sensors
    Wen, Li-Hua
    Jo, Kang-Hyun
    PROCEEDINGS OF 2021 IEEE 30TH INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2021,
  • [2] Fully Convolutional One-Stage 3D Object Detection on LiDAR Range Images
    Tian, Zhi
    Chu, Xiangxiang
    Wang, Xiaoming
    Wei, Xiaolin
    Shen, Chunhua
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [3] BAFusion: Bidirectional Attention Fusion for 3D Object Detection Based on LiDAR and Camera
    Liu, Min
    Jia, Yuanjun
    Lyu, Youhao
    Dong, Qi
    Yang, Yanyu
    SENSORS, 2024, 24 (14)
  • [4] Spatial-Aware Learning in Feature Embedding and Classification for One-Stage 3-D Object Detection
    Wu, Yiqiang
    Xiao, Weiping
    Gao, Jiantao
    Liu, Chang
    Qin, Yu
    Peng, Yan
    Li, Xiaomao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 12
  • [5] Robust LiDAR-Camera 3-D Object Detection With Object-Level Feature Fusion
    Chen, Yongxiang
    Yan, Fuwu
    Yin, Zhishuai
    Nie, Linzhen
    Tao, Bo
    Miao, Mingze
    Zheng, Ningyu
    Zhang, Pei
    Zeng, Junyuan
    IEEE SENSORS JOURNAL, 2024, 24 (18) : 29108 - 29120
  • [6] Exploring Context Information in Camera and Lidar-Based 3-D Object Detection Using Graphs and Transformers
    Sun, Chang
    Li, Yuehua
    Xing, Yan
    Zhang, Weidong
    Ai, Yibo
    Wang, Sheng
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [7] One-stage pulmonary nodule detection using 3-D DCNN with feature fusion and attention mechanism in CT image
    Huang, Yao-Sian
    Chou, Ping-Ru
    Chen, Hsin-Ming
    Chang, Yeun-Chung
    Chang, Ruey-Feng
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 220
  • [8] LiDAR-camera-system-based 3D object detection with proposal selection and grid attention pooling
    An, Pei
    Liang, Junxiong
    Ma, Jie
    APPLIED OPTICS, 2022, 61 (11) : 2998 - 3007
  • [9] Path aggregation one-stage anchor free 3D object detection
    Liu, Yanfei
    Li, Chao
    Ning, Kanglin
    Li, Yali
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (8) : 25085 - 25103
  • [10] Path aggregation one-stage anchor free 3D object detection
    Yanfei Liu
    Chao Li
    Kanglin Ning
    Yali Li
    Multimedia Tools and Applications, 2024, 83 : 25085 - 25103