A Flexible Multi-view Multi-modal Imaging System for Outdoor Scenes

被引:0
|
作者
Zhang, Meng [1 ]
Guo, Wenxuan [1 ]
Fan, Bohao [1 ]
Chen, Yifan [1 ]
Feng, Jianjiang [1 ]
Zhou, Jie [1 ]
机构
[1] Tsinghua Univ, BNRist, Dept Automat, Beijing, Peoples R China
关键词
3D; DATASET;
D O I
10.1109/3DV57658.2022.00044
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-view imaging systems enable uniform coverage of 3D space and reduce the impact of occlusion, which is beneficial for 3D object detection and tracking accuracy. However, existing imaging systems built with multi-view cameras or depth sensors are limited by the small applicable scene and complicated composition. In this paper, we propose a wireless multi-view multi-modal 3D imaging system generally applicable to large outdoor scenes, which consists of a master node and several slave nodes. Multiple spatially distributed slave nodes equipped with cameras and LiDARs are connected to form a wireless sensor network. While providing flexibility and scalability, the system applies automatic spatio-temporal calibration techniques to obtain accurate 3D multi-view multi-modal data. This system is the first imaging system that integrates mutli-view RGB cameras and LiDARs in large outdoor scenes among existing 3D imaging systems. We perform point clouds based 3D object detection and long-term tracking using the 3D imaging dataset collected by this system. The experimental results show that multi-view point clouds greatly improve 3D object detection and tracking accuracy regardless of complex and various outdoor environments.
引用
收藏
页码:322 / 331
页数:10
相关论文
共 50 条
  • [21] A MULTI-MODAL VIEW OF MEMORY
    HERRMANN, DJ
    SEARLEMAN, A
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1988, 26 (06) : 503 - 503
  • [22] Multi-View Multi-Modal Head-Gaze Estimation for Advanced Indoor User Interaction
    Kim, Jung-Hwa
    Jeong, Jin-Woo
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (03): : 5107 - 5132
  • [23] MMNeRF: Multi-Modal and Multi-View Optimized Cross-Scene Neural Radiance Fields
    Zhang, Qi
    Wang, Bo Han
    Yang, Ming Chuan
    Zou, Hang
    IEEE ACCESS, 2023, 11 : 27401 - 27413
  • [24] Graph-Based Multi-Modal Multi-View Fusion for Facial Action Unit Recognition
    Chen, Jianrong
    Dey, Sujit
    IEEE ACCESS, 2024, 12 : 69310 - 69324
  • [25] MULTI-VIEW AND MULTI-MODAL EVENT DETECTION UTILIZING TRANSFORMER-BASED MULTI-SENSOR FUSION
    Yasuda, Masahiro
    Ohishi, Yasunori
    Saito, Shoichiro
    Harado, Noboru
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4638 - 4642
  • [26] Monocular multi-view stereo imaging system
    Jiang, W.
    Shimizu, M.
    Okutomi, M.
    JOURNAL OF THE EUROPEAN OPTICAL SOCIETY-RAPID PUBLICATIONS, 2011, 6 : 10
  • [27] Multi-modal Sensor Module for Outdoor Robots
    Uhm, Taeyoung
    Park, Jeongwoo
    Bae, Gi-Doek
    Lee, Jong-Deuk
    Choi, Young-Ho
    2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2020, : 635 - 636
  • [28] A multi-modal scanning system to digitize CBRNE emergency response scenes
    Salathe, Marco
    Quiter, Brian J.
    Bandstra, Mark S.
    Chen, Xin
    Negut, Victor
    Folsom, Micah
    Weber, Gunther H.
    Greulich, Christopher
    Swinney, Mathew
    Prins, Nicholas
    Archer, Daniel E.
    2022 IEEE INTERNATIONAL SYMPOSIUM ON SAFETY, SECURITY, AND RESCUE ROBOTICS (SSRR), 2022, : 74 - 79
  • [29] Collaborative recommendation model based on multi-modal multi-view attention network: Movie and literature cases
    Hu, Zheng
    Cai, Shi-Min
    Wang, Jun
    Zhou, Tao
    APPLIED SOFT COMPUTING, 2023, 144
  • [30] M3V: Multi-modal Multi-view Context Embedding for Repair Operator Prediction
    Xu, Xuezheng
    Wang, Xudong
    Xue, Jingling
    CGO 2022 - Proceedings of the 2022 IEEE/ACM International Symposium on Code Generation and Optimization, 2022, : 266 - 277