Mitigating imbalances in heterogeneous feature fusion for multi-class 6D pose estimation

被引:2
|
作者
Wang, Huafeng [1 ]
Zhang, Haodu [2 ]
Liu, Wanquan [2 ]
Lv, Weifeng [3 ]
Gu, Xianfeng [4 ]
Guo, Kexin [5 ]
机构
[1] North China Univ Technol, Sch Informat Technol, Beijing 100041, Peoples R China
[2] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Guangzhou 510335, Peoples R China
[3] Beihang Univ, Sch Comp Sci, Beijing 100083, Peoples R China
[4] Dept Comp Sci, Stony Brook, NY 11794 USA
[5] Beihang Univ, Hangzhou Innovat Inst, Hangzhou 310051, Peoples R China
基金
国家重点研发计划;
关键词
6D pose estimation; Heterogeneous information; Feature fusion; Unequal contributions; Point cloud; OBJECT; NETWORK;
D O I
10.1016/j.knosys.2024.111918
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most 6D pose studies often treat RGB and Depth features equally in fusion, potentially limiting model generalization, especially in multi -class tasks. This limitation arises from prevalent static map generation strategies that overlook discriminative features in downsampling sparse point clouds. Additionally, the commonly adopted direct concatenation approach in heterogeneous feature fusion often leads to an averaging effect, thereby reducing the effectiveness of each feature. To tackle these challenges, we propose an effective model for dynamic graph structure feature extraction, aimed at capturing richer features from point clouds. And we introduce an adaptive fusion method for heterogeneous features, which takes into account the unequal contributions to 6D pose estimation. Validation on benchmark datasets LineMOD and YCB-Video demonstrates its effectiveness for multi -class 6D pose estimation, surpassing existing fusion methods. Of particular significance, our method attains state-of-the-art (SOTA) results on the YCB-Video dataset.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] 6D Pose Estimation with Correlation Fusion
    Cheng, Yi
    Zhu, Hongyuan
    Sun, Ying
    Acar, Cihan
    Jing, Wei
    Wu, Yan
    Li, Liyuan
    Tan, Cheston
    Lim, Joo-Hwee
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 2988 - 2994
  • [2] Deep Fusion for Multi-Modal 6D Pose Estimation
    Lin, Shifeng
    Wang, Zunran
    Zhang, Shenghao
    Ling, Yonggen
    Yang, Chenguang
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (04) : 6540 - 6549
  • [3] SEMI-DECOUPLED 6D POSE ESTIMATION VIA MULTI-MODAL FEATURE FUSION
    Zhang, Zhenhu
    Cao, Xin
    Jin, Li
    Qin, Xueying
    Tong, Ruofeng
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 2610 - 2614
  • [4] A RGB-D feature fusion network for occluded object 6D pose estimation
    Song, Yiwei
    Tang, Chunhui
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (8-9) : 6309 - 6319
  • [5] 6D Object Pose Estimation Based on Cross-Modality Feature Fusion
    Jiang, Meng
    Zhang, Liming
    Wang, Xiaohua
    Li, Shuang
    Jiao, Yijie
    SENSORS, 2023, 23 (19)
  • [6] A Novel Depth and Color Feature Fusion Framework for 6D Object Pose Estimation
    Zhou, Guangliang
    Yan, Yi
    Wang, Deming
    Chen, Qijun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1630 - 1639
  • [7] A Lightweight Two-End Feature Fusion Network for Object 6D Pose Estimation
    Zuo, Ligang
    Xie, Lun
    Pan, Hang
    Wang, Zhiliang
    MACHINES, 2022, 10 (04)
  • [8] CMFF6D: Cross-modality multiscale feature fusion network for 6D pose estimation
    Han, Zongwang
    Chen, Long
    Wu, Shiqing
    NEUROCOMPUTING, 2025, 623
  • [9] MPF6D: masked pyramid fusion 6D pose estimation
    Nuno Pereira
    Luís A. Alexandre
    Pattern Analysis and Applications, 2023, 26 (3) : 1363 - 1373
  • [10] MPF6D: masked pyramid fusion 6D pose estimation
    Pereira, Nuno
    Alexandre, Luis A.
    PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (03) : 1363 - 1373