Monocular 3D Object Detection Utilizing Auxiliary Learning With Deformable Convolution

被引：2

作者：

Chen, Jiun-Han ^{[1
]}

Shieh, Jeng-Lun ^{[1
]}

Haq, Muhamad Amirul ^{[1
]}

Ruan, Shanq-Jang ^{[1
]}

机构：

[1] Natl Taiwan Univ Sci & Technol, Dept Elect & Comp Engn, Taipei 10607, Taiwan

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2024年 / 25卷 / 03期

关键词：

Three-dimensional displays; Object detection; Solid modeling; Feature extraction; Training; Computational modeling; Task analysis; 3D object detection; monocular camera; driving scene understanding; auxiliary learning; deep learning;

D O I：

10.1109/TITS.2023.3319556

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

In autonomous driving systems, the monocular 3D object detection algorithm is a crucial component. The safety of autonomous vehicles heavily depends on a well-designed detection system. Therefore, developing a robust and efficient 3D object detection algorithm is a major goal for institutes and researchers. Having a 3D sense is essential in autonomous vehicles and robotics, as it allows the system to understand its surroundings and react accordingly. Compared with stereo-based and Lidar-based methods, monocular 3D Object detection is a challenging task as it only utilizes 2D information to generate complex 3D features, making it low-cost, less computationally intensive, and with great potential. However, the performance of monocular methods is impaired due to the lack of depth information. In this paper, we propose a simple, end-to-end, and effective network for monocular 3D object detection without the use of external training data. Our work is inspired by auxiliary learning, in which we use a robust feature extractor as our backbone and multiple regression heads to learn auxiliary knowledge. These auxiliary regression heads will be discarded after training for improved inference efficiency, allowing us to take advantage of auxiliary learning and enabling the model to learn critical information more conceptually. The proposed method achieves 17.28% and 20.10% for the moderate level of the Car category on the KITTI benchmark test set and validation set, respectively, which outperforms the previous monocular 3D object detection approaches.

引用

页码：2424 / 2436

页数：13

共 50 条

[21] DROP SPARSE CONVOLUTION FOR 3D OBJECT DETECTION
Zhu, Taohong
Shen, Jun
Wang, Chali
Xiong, Huiyuan
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 3185 - 3189
[22] GAC3D: improving monocular 3D object detection with ground-guide model and adaptive convolution
Bui, Minh-Quan Viet
Ngo, Duc Tuan
Pham, Hoang-Anh
Nguyen, Duc Dung
PEERJ COMPUTER SCIENCE, 2021, 7
[23] eGAC3D: enhancing depth adaptive convolution and depth estimation for monocular 3D object pose detection
Ngo, Duc Tuan
Bui, Minh-Quan Viet
Nguyen, Duc Dung
Pham, Hoang-Anh
PEERJ COMPUTER SCIENCE, 2022, 8
[24] A Survey on Deep Learning Based Methods and Datasets for Monocular 3D Object Detection
Kim, Seong-heum
Hwang, Youngbae
ELECTRONICS, 2021, 10 (04) : 1 - 22
[25] SSD-MonoDETR: Supervised Scale-Aware Deformable Transformer for Monocular 3D Object Detection
He, Xuan
Yang, Fan
Yang, Kailun
Lin, Jiacheng
Fu, Haolong
Wang, Meng
Yuan, Jin
Li, Zhiyong
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 555 - 567
[26] Monocular 3D Object Detection with Bounding Box Denoising in 3D by Perceiver
Liu, Xianpeng
Zheng, Ce
Cheng, Kelvin
Xue, Nan
Qi, Guo-Jun
Wu, Tianfu
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6413 - 6423
[27] Progressive Coordinate Transforms for Monocular 3D Object Detection
Wang, Li
Zhang, Li
Zhu, Yi
Zhang, Zhi
He, Tong
Li, Mu
Xue, Xiangyang
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[28] Exploring Geometric Consistency for Monocular 3D Object Detection
Lian, Qing
Ye, Botao
Xu, Ruijia
Yao, Weilong
Zhang, Tong
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1675 - 1684
[29] MonoSG: Monocular 3D Object Detection With Stereo Guidance
Fan, Zhiwei
Xu, Chao
Chu, Minghang
Huang, Yuling
Ma, Yaoyao
Wang, Jing
Xu, Yishen
Wu, Di
IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (04): : 3604 - 3611
[30] Monocular 3D Object Detection With Motion Feature Distillation
Hu, Henan
Li, Muyu
Zhu, Ming
Gao, Wen
Liu, Peiyu
Chan, Kwok-Leung
IEEE ACCESS, 2023, 11 : 82933 - 82945

← 1 2 3 4 5 →