Mitigating imbalances in heterogeneous feature fusion for multi-class 6D pose estimation

被引:2
|
作者
Wang, Huafeng [1 ]
Zhang, Haodu [2 ]
Liu, Wanquan [2 ]
Lv, Weifeng [3 ]
Gu, Xianfeng [4 ]
Guo, Kexin [5 ]
机构
[1] North China Univ Technol, Sch Informat Technol, Beijing 100041, Peoples R China
[2] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Guangzhou 510335, Peoples R China
[3] Beihang Univ, Sch Comp Sci, Beijing 100083, Peoples R China
[4] Dept Comp Sci, Stony Brook, NY 11794 USA
[5] Beihang Univ, Hangzhou Innovat Inst, Hangzhou 310051, Peoples R China
基金
国家重点研发计划;
关键词
6D pose estimation; Heterogeneous information; Feature fusion; Unequal contributions; Point cloud; OBJECT; NETWORK;
D O I
10.1016/j.knosys.2024.111918
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most 6D pose studies often treat RGB and Depth features equally in fusion, potentially limiting model generalization, especially in multi -class tasks. This limitation arises from prevalent static map generation strategies that overlook discriminative features in downsampling sparse point clouds. Additionally, the commonly adopted direct concatenation approach in heterogeneous feature fusion often leads to an averaging effect, thereby reducing the effectiveness of each feature. To tackle these challenges, we propose an effective model for dynamic graph structure feature extraction, aimed at capturing richer features from point clouds. And we introduce an adaptive fusion method for heterogeneous features, which takes into account the unequal contributions to 6D pose estimation. Validation on benchmark datasets LineMOD and YCB-Video demonstrates its effectiveness for multi -class 6D pose estimation, surpassing existing fusion methods. Of particular significance, our method attains state-of-the-art (SOTA) results on the YCB-Video dataset.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] SaMfENet: Self-Attention Based Multi-Scale Feature Fusion Coding and Edge Information Constraint Network for 6D Pose Estimation
    Li, Zhuoxiao
    Li, Xiaobing
    Chen, Shihao
    Du, Jialong
    Li, Yong
    MATHEMATICS, 2022, 10 (19)
  • [42] Lightweight Full-Flow Bidirectional Fusion Network for 6D Pose Estimation
    Lin, Haotian
    Li, Yongchang
    Jiang, Jing
    Qin, Guangjun
    Computer Engineering and Applications, 2024, 60 (22) : 282 - 291
  • [43] FormerPose: An efficient multi-scale fusion Transformer network based on RGB-D for 6D pose estimation
    Hou, Pihong
    Zhang, Yongfang
    Wu, Yi
    Yan, Pengyu
    Zhang, Fuqiang
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2025, 106
  • [44] Single Shot 6D Object Pose Estimation
    Kleeberger, Kilian
    Huber, Marco F.
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 6239 - 6245
  • [45] BOP: Benchmark for 6D Object Pose Estimation
    Hodan, Tomas
    Michel, Frank
    Brachmann, Eric
    Kehl, Wadim
    Buch, Anders Glent
    Kraft, Dirk
    Drost, Bertram
    Vidal, Joel
    Ihrke, Stephan
    Zabulis, Xenophon
    Sahin, Caner
    Manhardt, Fabian
    Tombari, Federico
    Kim, Tae-Kyun
    Matas, Jiri
    Rother, Carsten
    COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 19 - 35
  • [46] Adaptive Multimodal-Feature Fusion for 6D Object Position Estimation
    Zang, Chuanfang
    Dang, Jianwu
    Yong, Jiu
    LASER & OPTOELECTRONICS PROGRESS, 2025, 62 (04)
  • [47] Survey on 6D Pose Estimation of Rigid Object
    Chen, Jiale
    Zhang, Lijun
    Liu, Yi
    Xu, Chi
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7440 - 7445
  • [48] Orientation Keypoints for 6D Human Pose Estimation
    Fisch, Martin
    Clark, Ronald
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 10145 - 10158
  • [49] PoseMatcher: One-shot 6D Object Pose Estimation by Deep Feature Matching
    Castro, Pedro
    Kim, Tae-Kyun
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2140 - 2149
  • [50] HEAD POSE ESTIMATION THROUGH MULTI-CLASS FACE SEGMENTATION
    Khan, Khalil
    Mauro, Massimo
    Migliorati, Pierangelo
    Leonardi, Riccardo
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 175 - 180