Mitigating imbalances in heterogeneous feature fusion for multi-class 6D pose estimation

被引：2

作者：

Wang, Huafeng ^{[1
]}

Zhang, Haodu ^{[2
]}

Liu, Wanquan ^{[2
]}

Lv, Weifeng ^{[3
]}

Gu, Xianfeng ^{[4
]}

Guo, Kexin ^{[5
]}

机构：

[1] North China Univ Technol, Sch Informat Technol, Beijing 100041, Peoples R China

[2] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Guangzhou 510335, Peoples R China

[3] Beihang Univ, Sch Comp Sci, Beijing 100083, Peoples R China

[4] Dept Comp Sci, Stony Brook, NY 11794 USA

[5] Beihang Univ, Hangzhou Innovat Inst, Hangzhou 310051, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2024年 / 297卷

基金：

国家重点研发计划;

关键词：

6D pose estimation; Heterogeneous information; Feature fusion; Unequal contributions; Point cloud; OBJECT; NETWORK;

D O I：

10.1016/j.knosys.2024.111918

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most 6D pose studies often treat RGB and Depth features equally in fusion, potentially limiting model generalization, especially in multi -class tasks. This limitation arises from prevalent static map generation strategies that overlook discriminative features in downsampling sparse point clouds. Additionally, the commonly adopted direct concatenation approach in heterogeneous feature fusion often leads to an averaging effect, thereby reducing the effectiveness of each feature. To tackle these challenges, we propose an effective model for dynamic graph structure feature extraction, aimed at capturing richer features from point clouds. And we introduce an adaptive fusion method for heterogeneous features, which takes into account the unequal contributions to 6D pose estimation. Validation on benchmark datasets LineMOD and YCB-Video demonstrates its effectiveness for multi -class 6D pose estimation, surpassing existing fusion methods. Of particular significance, our method attains state-of-the-art (SOTA) results on the YCB-Video dataset.

引用

页数：13

共 50 条

[1] 6D Pose Estimation with Correlation Fusion
Cheng, Yi
Zhu, Hongyuan
Sun, Ying
Acar, Cihan
Jing, Wei
Wu, Yan
Li, Liyuan
Tan, Cheston
Lim, Joo-Hwee
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 2988 - 2994
[2] Deep Fusion for Multi-Modal 6D Pose Estimation
Lin, Shifeng
Wang, Zunran
Zhang, Shenghao
Ling, Yonggen
Yang, Chenguang
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (04) : 6540 - 6549
[3] SEMI-DECOUPLED 6D POSE ESTIMATION VIA MULTI-MODAL FEATURE FUSION
Zhang, Zhenhu
Cao, Xin
Jin, Li
Qin, Xueying
Tong, Ruofeng
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 2610 - 2614
[4] A RGB-D feature fusion network for occluded object 6D pose estimation
Song, Yiwei
Tang, Chunhui
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (8-9) : 6309 - 6319
[5] 6D Object Pose Estimation Based on Cross-Modality Feature Fusion
Jiang, Meng
Zhang, Liming
Wang, Xiaohua
Li, Shuang
Jiao, Yijie
SENSORS, 2023, 23 (19)
[6] A Novel Depth and Color Feature Fusion Framework for 6D Object Pose Estimation
Zhou, Guangliang
Yan, Yi
Wang, Deming
Chen, Qijun
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1630 - 1639
[7] A Lightweight Two-End Feature Fusion Network for Object 6D Pose Estimation
Zuo, Ligang
Xie, Lun
Pan, Hang
Wang, Zhiliang
MACHINES, 2022, 10 (04)
[8] CMFF6D: Cross-modality multiscale feature fusion network for 6D pose estimation
Han, Zongwang
Chen, Long
Wu, Shiqing
NEUROCOMPUTING, 2025, 623
[9] MPF6D: masked pyramid fusion 6D pose estimation
Nuno Pereira
Luís A. Alexandre
Pattern Analysis and Applications, 2023, 26 (3) : 1363 - 1373
[10] MPF6D: masked pyramid fusion 6D pose estimation
Pereira, Nuno
Alexandre, Luis A.
PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (03) : 1363 - 1373

← 1 2 3 4 5 →