TransMatch: Transformer-based correspondence pruning via local and global consensus

被引:0
|
作者
Liu, Yizhang [1 ,2 ]
Li, Yanping [3 ]
Zhao, Shengjie [1 ,4 ]
机构
[1] Tongji Univ, Sch Software Engn, Shanghai 201804, Peoples R China
[2] Fuzhou Univ, Coll Comp & Data Sci, Fuzhou, Peoples R China
[3] Tongji Univ, Dept Comp Sci & Technol, Shanghai 201804, Peoples R China
[4] Minist Educ, Engn Res Ctr, Key Software Technol Smart City Percept & Planning, Shanghai 200003, Peoples R China
基金
中国国家自然科学基金;
关键词
Correspondence pruning; Transformer; Local and global consensus; Camera pose estimation;
D O I
10.1016/j.patcog.2024.111120
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Correspondence pruning aims to filter out false correspondences (a.k.a. outliers) from the initial feature correspondence set, which is pivotal to matching-based vision tasks, such as image registration. To solve this problem, most existing learning-based methods typically use a multilayer perceptron framework and several well-designed modules to capture local and global contexts. However, few studies have explored how local and global consensuses interact to form cohesive feature representations. This paper proposes a novel framework called TransMatch, which leverages the full power of Transformer structure to extract richer features and facilitate progressive local and global consensus learning. In addition to enhancing feature learning, Transformer is used as a powerful tool to connect the above two consensuses. Benefiting from Transformer, our TransMatch is surprisingly effective for differentiating correspondences. Experimental results on correspondence pruning and camera pose estimation demonstrate that the proposed TransMatch outperforms other state-of-the-art methods by a large margin. The code will be available at https://github. com/lyz8023lyp/TransMatch/.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] ShapeFormer: Transformer-based Shape Completion via Sparse Representation
    Yan, Xingguang
    Lin, Liqiang
    Mitra, Niloy J.
    Lischinski, Dani
    Cohen-Or, Daniel
    Huang, Hui
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6229 - 6239
  • [22] Accelerating Transformer-based Deep Learning Models on FPGAs using Column Balanced Block Pruning
    Peng, Hongwu
    Huang, Shaoyi
    Geng, Tong
    Li, Ang
    Jiang, Weiwen
    Liu, Hang
    Wang, Shusen
    Ding, Caiwen
    PROCEEDINGS OF THE 2021 TWENTY SECOND INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2021), 2021, : 142 - 148
  • [23] FTGM: Fast Transformer-Based Global Matching for Particle Image Velocimetry
    Ding, Shuaimin
    Zhao, Tianqing
    Yang, Jun
    Zhang, Dezhi
    APPLIED SCIENCES-BASEL, 2025, 15 (03):
  • [24] Transformer-Based Zero-Shot Detection via Contrastive Learning
    Liu, Wei
    Chen, Hui
    Ma, Yongqiang
    Wang, Jianji
    Zheng, Nanning
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2022, PART I, 2022, 646 : 316 - 327
  • [25] Video Review Analysis via Transformer-based Sentiment Change Detection
    Wu, Zilong
    Huang, Siyuan
    Zhang, Rui
    Li, Lin
    THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2020), 2020, : 330 - 335
  • [26] Improving Conversational Recommender Systems via Transformer-based Sequential Modelling
    Zou, Jie
    Kanoulas, Evangelos
    Ren, Pengjie
    Ren, Zhaochun
    Sun, Aixin
    Long, Cheng
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 2319 - 2324
  • [27] Hierarchical Image Generation via Transformer-Based Sequential Patch Selection
    Xu, Xiaogang
    Xu, Ning
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2938 - 2945
  • [28] MatteFormer: Transformer-Based Image Matting via Prior-Tokens
    Park, GyuTae
    Son, SungJoon
    Yoo, JaeYoung
    Kim, SeHo
    Kwak, Nojun
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11686 - 11696
  • [29] Transformer-Based Contrastive Multi-view Clustering via Ensembles
    Zhao, Mingyu
    Yang, Weidong
    Nie, Feiping
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT I, 2023, 14169 : 678 - 694
  • [30] ETR: Enhancing Taillight Recognition via Transformer-Based Video Classification
    Zhou, Jiakai
    Yang, Jun
    Wu, Xiaoliang
    Zhou, Wanlin
    Wang, Yang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (02) : 2721 - 2733