TransMatch: Transformer-based correspondence pruning via local and global consensus

被引:0
|
作者
Liu, Yizhang [1 ,2 ]
Li, Yanping [3 ]
Zhao, Shengjie [1 ,4 ]
机构
[1] Tongji Univ, Sch Software Engn, Shanghai 201804, Peoples R China
[2] Fuzhou Univ, Coll Comp & Data Sci, Fuzhou, Peoples R China
[3] Tongji Univ, Dept Comp Sci & Technol, Shanghai 201804, Peoples R China
[4] Minist Educ, Engn Res Ctr, Key Software Technol Smart City Percept & Planning, Shanghai 200003, Peoples R China
基金
中国国家自然科学基金;
关键词
Correspondence pruning; Transformer; Local and global consensus; Camera pose estimation;
D O I
10.1016/j.patcog.2024.111120
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Correspondence pruning aims to filter out false correspondences (a.k.a. outliers) from the initial feature correspondence set, which is pivotal to matching-based vision tasks, such as image registration. To solve this problem, most existing learning-based methods typically use a multilayer perceptron framework and several well-designed modules to capture local and global contexts. However, few studies have explored how local and global consensuses interact to form cohesive feature representations. This paper proposes a novel framework called TransMatch, which leverages the full power of Transformer structure to extract richer features and facilitate progressive local and global consensus learning. In addition to enhancing feature learning, Transformer is used as a powerful tool to connect the above two consensuses. Benefiting from Transformer, our TransMatch is surprisingly effective for differentiating correspondences. Experimental results on correspondence pruning and camera pose estimation demonstrate that the proposed TransMatch outperforms other state-of-the-art methods by a large margin. The code will be available at https://github. com/lyz8023lyp/TransMatch/.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Local Consensus Transformer for Correspondence Learning
    Wang, Gang
    Chen, Yufei
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1151 - 1156
  • [2] AUTOPRUNER: Transformer-Based Call Graph Pruning
    Le-Cong, Thanh
    Kang, Hong Jin
    Truong Giang Nguyen
    Haryono, Stefanus Agus
    Lo, David
    Le, Xuan-Bach D.
    Quyet Thang Huynh
    PROCEEDINGS OF THE 30TH ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2022, 2022, : 520 - 532
  • [3] Transformer-based local-global guidance for image captioning
    Parvin, Hashem
    Naghsh-Nilchi, Ahmad Reza
    Mohammadi, Hossein Mahvash
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 223
  • [4] Local and global convolutional transformer-based motor imagery EEG classification
    Zhang, Jiayang
    Li, Kang
    Yang, Banghua
    Han, Xiaofei
    FRONTIERS IN NEUROSCIENCE, 2023, 17
  • [5] Local-Global Self-Attention for Transformer-Based Object Tracking
    Chen, Langkun
    Gao, Long
    Jiang, Yan
    Li, Yunsong
    He, Gang
    Ning, Jifeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 12316 - 12329
  • [6] Two-View Correspondence Learning With Local Consensus Transformer
    Wang, Gang
    Chen, Yufei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [7] TransMatch: A Transformer-Based Multilevel Dual-Stream Feature Matching Network for Unsupervised Deformable Image Registration
    Chen, Zeyuan
    Zheng, Yuanjie
    Gee, James C.
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (01) : 15 - 27
  • [8] Local to Global: A Sparse Transformer-Based Small Object Detector for Remote Sensing Images
    Li, Zheng
    Wang, Yongcheng
    Feng, Hao
    Chen, Chi
    Xu, Dongdong
    Zhao, Tianqi
    Gao, Yunxiao
    Zhao, Zhikang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [9] A more reliable local-global-guided network for correspondence pruning
    Peng, Chengli
    Yang, Zhenyu
    Lu, Yiwei
    Li, Zizhuo
    Jin, Qiwen
    PATTERN RECOGNITION LETTERS, 2024, 181 : 16 - 22
  • [10] Video text tracking with transformer-based local search
    Zhou, Xingsheng
    Wang, Cheng
    Wang, Xinggang
    Liu, Wenyu
    NEUROCOMPUTING, 2024, 609