TransMatch: Transformer-based correspondence pruning via local and global consensus

被引:0
|
作者
Liu, Yizhang [1 ,2 ]
Li, Yanping [3 ]
Zhao, Shengjie [1 ,4 ]
机构
[1] Tongji Univ, Sch Software Engn, Shanghai 201804, Peoples R China
[2] Fuzhou Univ, Coll Comp & Data Sci, Fuzhou, Peoples R China
[3] Tongji Univ, Dept Comp Sci & Technol, Shanghai 201804, Peoples R China
[4] Minist Educ, Engn Res Ctr, Key Software Technol Smart City Percept & Planning, Shanghai 200003, Peoples R China
基金
中国国家自然科学基金;
关键词
Correspondence pruning; Transformer; Local and global consensus; Camera pose estimation;
D O I
10.1016/j.patcog.2024.111120
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Correspondence pruning aims to filter out false correspondences (a.k.a. outliers) from the initial feature correspondence set, which is pivotal to matching-based vision tasks, such as image registration. To solve this problem, most existing learning-based methods typically use a multilayer perceptron framework and several well-designed modules to capture local and global contexts. However, few studies have explored how local and global consensuses interact to form cohesive feature representations. This paper proposes a novel framework called TransMatch, which leverages the full power of Transformer structure to extract richer features and facilitate progressive local and global consensus learning. In addition to enhancing feature learning, Transformer is used as a powerful tool to connect the above two consensuses. Benefiting from Transformer, our TransMatch is surprisingly effective for differentiating correspondences. Experimental results on correspondence pruning and camera pose estimation demonstrate that the proposed TransMatch outperforms other state-of-the-art methods by a large margin. The code will be available at https://github. com/lyz8023lyp/TransMatch/.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] A Vision Transformer-Based Approach to Bearing Fault Classification via Vibration Signals
    Zim, Abid Hasan
    Ashraf, Aeyan
    Iqbal, Aquib
    Malik, Asad
    Kuribayashi, Minoru
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1321 - 1326
  • [42] A Transformer-based visual object tracker via learning immediate appearance change
    Li, Yifan
    Liu, Xiaotao
    Yuan, Dian
    Wang, Jiaoying
    Wu, Peng
    Liu, Jing
    PATTERN RECOGNITION, 2024, 155
  • [43] Transformer-based material recognition via short-time contact sensing
    Liu, Zhenyang
    Shao, Yitian
    Li, Qiliang
    Su, Jingyong
    PATTERN RECOGNITION, 2025, 163
  • [44] Improving Object Grasp Performance via Transformer-Based Sparse Shape Completion
    Chen, Wenkai
    Liang, Hongzhuo
    Chen, Zhaopeng
    Sun, Fuchun
    Zhang, Jianwei
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2022, 104 (03)
  • [45] Transformer-Based Approach Via Contrastive Learning for Zero-Shot Detection
    Liu, Wei
    Chen, Hui
    Ma, Yongqiang
    Wang, Jianji
    Zheng, Nanning
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2023, 33 (07)
  • [46] Transformer-Based Model Predictive Control: Trajectory Optimization via Sequence Modeling
    Celestini, Davide
    Gammelli, Daniele
    Guffanti, Tommaso
    D'Amico, Simone
    Capello, Elisa
    Pavone, Marco
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (11): : 9820 - 9827
  • [47] TRANSFORMER-BASED END-TO-END SPEECH RECOGNITION WITH LOCAL DENSE SYNTHESIZER ATTENTION
    Xu, Menglong
    Li, Shengqiang
    Zhang, Xiao-Lei
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5899 - 5903
  • [48] MR-Matcher: A Multirouting Transformer-Based Network for Accurate Local Feature Matching
    Jiang, Zhiqiang
    Wang, Ke
    Kong, Qingjia
    Dai, Kun
    Xie, Tao
    Qin, Zhonghao
    Li, Ruifeng
    Perner, Petra
    Zhao, Lijun
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [49] Aggregating Global and Local Representations via Hybrid Transformer for Video Deraining
    Mao, Deqian
    Gao, Shanshan
    Li, Zhenyu
    Dai, Honghao
    Zhang, Yunfeng
    Zhou, Yuanfeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7512 - 7522
  • [50] Local Attention Transformer-Based Full-View Finger-Vein Identification
    Qin, Huafeng
    Hu, Rongshan
    El-Yacoubi, Mounim A.
    Li, Yantao
    Gao, Xinbo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (06) : 2767 - 2782