M3SOT: Multi-Frame, Multi-Field, Multi-Space 3D Single Object Tracking

被引:0
|
作者
Liu, Jiaming [1 ]
Wu, Yue [1 ]
Gong, Maoguo [1 ]
Miao, Qiguang [1 ]
Ma, Wenping [1 ]
Xu, Cai [1 ]
Qin, Can [2 ]
机构
[1] Xidian Univ, Xian, Peoples R China
[2] Northeastern Univ, Boston, MA USA
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D Single Object Tracking (SOT) stands a forefront task of computer vision, proving essential for applications like autonomous driving. Sparse and occluded data in scene point clouds introduce variations in the appearance of tracked objects, adding complexity to the task. In this research, we unveil M3SOT, a novel 3D SOT framework, which synergizes multiple input frames (template sets), multiple receptive fields (continuous contexts), and multiple solution spaces (distinct tasks) in ONE model. Remarkably, M3SOT pioneers in modeling temporality, contexts, and tasks directly from point clouds, revisiting a perspective on the key factors influencing SOT. To this end, we design a transformer-based network centered on point cloud targets in the search area, aggregating diverse contextual representations and propagating target cues by employing historical frames. As M3SOT spans varied processing perspectives, we've streamlined the network-trimming its depth and optimizing its structure-to ensure a lightweight and efficient deployment for SOT applications. We posit that, backed by practical construction, M3SOT sidesteps the need for complex frameworks and auxiliary components to deliver sterling results. Extensive experiments on benchmarks such as KITTI, nuScenes, and Waymo Open Dataset demonstrate that M3SOT achieves state-of-the-art performance at 38 FPS. Our code and models are available at https://github.com/ywu0912/TeamCode.git.
引用
收藏
页码:3630 / 3638
页数:9
相关论文
共 50 条
  • [41] FANTrack: 3D Multi-Object Tracking with Feature Association Network
    Baser, Erkan
    Balasubramanian, Venkateshwaran
    Bhattacharyya, Prarthana
    Czarnecki, Krzysztof
    2019 30TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV19), 2019, : 1426 - 1433
  • [42] Exploring Simple 3D Multi-Object Tracking for Autonomous Driving
    Luo, Chenxu
    Yang, Xiaodong
    Yuille, Alan
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10468 - 10477
  • [43] Unlocking the power of multi-modal fusion in 3D object tracking
    Hu, Yue
    IET COMPUTER VISION, 2025, 19 (01)
  • [44] mmMCL3DMOT: Multi-Modal Momentum Contrastive Learning for 3D Multi-Object Tracking
    Hong, Ru
    Yang, Jiming
    Zhou, Weidian
    Da, Feipeng
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1895 - 1899
  • [45] Multi-Correlation Siamese Transformer Network With Dense Connection for 3D Single Object Tracking
    Feng, Shihao
    Liang, Pengpeng
    Gao, Jin
    Cheng, Erkang
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (12) : 8066 - 8073
  • [46] Track initialization and re-identification for 3D multi-view multi-object tracking
    Van Ma, Linh
    Nguyen, Tran Thien Dat
    Vo, Ba-Ngu
    Jang, Hyunsung
    Jeon, Moongu
    INFORMATION FUSION, 2024, 111
  • [47] MSA-MOT: Multi-Stage Association for 3D Multimodality Multi-Object Tracking
    Zhu, Ziming
    Nie, Jiahao
    Wu, Han
    He, Zhiwei
    Gao, Mingyu
    SENSORS, 2022, 22 (22)
  • [48] Optimization of the 3D multi-level SOT-MRAMs
    Lin, Hui
    Jiang, Yanfeng
    AIP ADVANCES, 2024, 14 (02)
  • [49] A 3D multi-field element for simulating the electromechanical coupling behavior of dielectric elastomers
    Jun Liu
    Choon Chiang Foo
    Zhi-Qian Zhang
    Acta Mechanica Solida Sinica, 2017, 30 : 374 - 389
  • [50] 3DMOTFormer: Graph Transformer for Online 3D Multi-Object Tracking
    Ding, Shuxiao
    Rehder, Eike
    Schneider, Lukas
    Cordts, Marius
    Gall, Juergen
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9750 - 9760