M3SOT: Multi-Frame, Multi-Field, Multi-Space 3D Single Object Tracking

被引：0

作者：

Liu, Jiaming ^{[1
]}

Wu, Yue ^{[1
]}

Gong, Maoguo ^{[1
]}

Miao, Qiguang ^{[1
]}

Ma, Wenping ^{[1
]}

Xu, Cai ^{[1
]}

Qin, Can ^{[2
]}

机构：

[1] Xidian Univ, Xian, Peoples R China

[2] Northeastern Univ, Boston, MA USA

来源：

THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 4 | 2024年

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

3D Single Object Tracking (SOT) stands a forefront task of computer vision, proving essential for applications like autonomous driving. Sparse and occluded data in scene point clouds introduce variations in the appearance of tracked objects, adding complexity to the task. In this research, we unveil M3SOT, a novel 3D SOT framework, which synergizes multiple input frames (template sets), multiple receptive fields (continuous contexts), and multiple solution spaces (distinct tasks) in ONE model. Remarkably, M3SOT pioneers in modeling temporality, contexts, and tasks directly from point clouds, revisiting a perspective on the key factors influencing SOT. To this end, we design a transformer-based network centered on point cloud targets in the search area, aggregating diverse contextual representations and propagating target cues by employing historical frames. As M3SOT spans varied processing perspectives, we've streamlined the network-trimming its depth and optimizing its structure-to ensure a lightweight and efficient deployment for SOT applications. We posit that, backed by practical construction, M3SOT sidesteps the need for complex frameworks and auxiliary components to deliver sterling results. Extensive experiments on benchmarks such as KITTI, nuScenes, and Waymo Open Dataset demonstrate that M3SOT achieves state-of-the-art performance at 38 FPS. Our code and models are available at https://github.com/ywu0912/TeamCode.git.

引用

页码：3630 / 3638

页数：9

共 50 条

[41] FANTrack: 3D Multi-Object Tracking with Feature Association Network
Baser, Erkan
Balasubramanian, Venkateshwaran
Bhattacharyya, Prarthana
Czarnecki, Krzysztof
2019 30TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV19), 2019, : 1426 - 1433
[42] Exploring Simple 3D Multi-Object Tracking for Autonomous Driving
Luo, Chenxu
Yang, Xiaodong
Yuille, Alan
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10468 - 10477
[43] Unlocking the power of multi-modal fusion in 3D object tracking
Hu, Yue
IET COMPUTER VISION, 2025, 19 (01)
[44] mmMCL3DMOT: Multi-Modal Momentum Contrastive Learning for 3D Multi-Object Tracking
Hong, Ru
Yang, Jiming
Zhou, Weidian
Da, Feipeng
IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1895 - 1899
[45] Multi-Correlation Siamese Transformer Network With Dense Connection for 3D Single Object Tracking
Feng, Shihao
Liang, Pengpeng
Gao, Jin
Cheng, Erkang
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (12) : 8066 - 8073
[46] Track initialization and re-identification for 3D multi-view multi-object tracking
Van Ma, Linh
Nguyen, Tran Thien Dat
Vo, Ba-Ngu
Jang, Hyunsung
Jeon, Moongu
INFORMATION FUSION, 2024, 111
[47] MSA-MOT: Multi-Stage Association for 3D Multimodality Multi-Object Tracking
Zhu, Ziming
Nie, Jiahao
Wu, Han
He, Zhiwei
Gao, Mingyu
SENSORS, 2022, 22 (22)
[48] Optimization of the 3D multi-level SOT-MRAMs
Lin, Hui
Jiang, Yanfeng
AIP ADVANCES, 2024, 14 (02)
[49] A 3D multi-field element for simulating the electromechanical coupling behavior of dielectric elastomers
Jun Liu
Choon Chiang Foo
Zhi-Qian Zhang
Acta Mechanica Solida Sinica, 2017, 30 : 374 - 389
[50] 3DMOTFormer: Graph Transformer for Online 3D Multi-Object Tracking
Ding, Shuxiao
Rehder, Eike
Schneider, Lukas
Cordts, Marius
Gall, Juergen
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9750 - 9760

← 1 2 3 4 5 →