Two-View Correspondence Learning With Local Consensus Transformer

被引:0
|
作者
Wang, Gang [1 ]
Chen, Yufei [2 ]
机构
[1] Shanghai Univ Finance & Econ, Sch Stat & Management, Shanghai 200433, Peoples R China
[2] Tongji Univ, Sch Comp Sci & Technol, Shanghai 201804, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Transformers; Robustness; Technological innovation; Network architecture; Image reconstruction; Image edge detection; Geometry; Computer vision; Computer architecture; Correspondence learning; feature matching; local consensus (LC); transformer;
D O I
10.1109/TNNLS.2024.3488197
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Correspondence learning is a crucial component in multiview geometry and computer vision. The presence of heavy outliers (mismatches) consistently renders the matching problem to be highly challenging. In this article, we revisit the benefits of local consensus (LC) in traditional feature matching and introduce the concept of LC to design a trainable neural network capable of capturing the underlying correspondences. This network is named the LC transformer (LCT) and is specifically tailored for wide-baseline stereo applications. Our network architecture comprises three distinct operations. To establish the neighbor topology, we employ a dynamic graph-based embedding layer as the initial step. Subsequently, these local topologies serve as guidance for the multihead self-attention layer, enabling it to extract a more extensive contextual understanding through channel attention (CA). Following this, order-aware graph pooling is applied to extract the global context information from the embedded LC. Through the experimental analysis, the ablation study reveals that PointNet-like learning models can, indeed, benefit from the incorporation of LC. The proposed model achieves state-of-the-art performance in both challenging scenes, namely, the YFCC100M outdoor and SUN3D indoor environments, even in the presence of more than 90% outliers.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] CGR-Net: Consistency Guided ResFormer for Two-View Correspondence Learning
    Yang, Changcai
    Li, Xiaojie
    Ma, Jiayi
    Zhuang, Fengyuan
    Wei, Lifang
    Chen, Riqing
    Chen, Guodong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 12450 - 12465
  • [22] PGFNet: Preference-Guided Filtering Network for Two-View Correspondence Learning
    Liu, Xin
    Xiao, Guobao
    Chen, Riqing
    Ma, Jiayi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1367 - 1378
  • [23] Learning Two-View Correspondences and Geometry via Local Neighborhood Correlation
    Dai, Luanyuan
    Liu, Xin
    Wang, Jingtao
    Yang, Changcai
    Chen, Riqing
    ENTROPY, 2021, 23 (08)
  • [24] Multi-Stage Network With Geometric Semantic Attention for Two-View Correspondence Learning
    Lin, Shuyuan
    Chen, Xiao
    Xiao, Guobao
    Wang, Hanzi
    Huang, Feiran
    Weng, Jian
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 3031 - 3046
  • [25] ACMatch: Improving context capture for two-view correspondence learning via adaptive convolution
    Fang, Xiang
    Lu, Yifan
    Zhang, Shihua
    Xie, Yining
    Ma, Jiayi
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2024, 218 : 466 - 480
  • [26] SGA-Net: A Sparse Graph Attention Network for Two-View Correspondence Learning
    Liao, Tangfei
    Zhang, Xiaoqin
    Xu, Yuewang
    Shi, Ziwei
    Xiao, Guobao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (12) : 7578 - 7590
  • [27] Two-view correspondence learning using graph neural network with reciprocal neighbor attention
    Li, Zizhuo
    Ma, Yong
    Mei, Xiaoguang
    Ma, Jiayi
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 202 : 114 - 124
  • [28] Learning Two-View Stereo Matching
    Xiao, Jianxiong
    Chen, Jingni
    Yeung, Dit-Yan
    Quan, Long
    COMPUTER VISION - ECCV 2008, PT III, PROCEEDINGS, 2008, 5304 : 15 - 27
  • [29] Point2CN: Progressive two-view correspondence learning via information fusion
    Liu, Xin
    Xiao, Guobao
    Li, Zuoyong
    Chen, Riqing
    SIGNAL PROCESSING, 2021, 189 (189)
  • [30] T-Net: Effective Permutation-Equivariant Network for Two-View Correspondence Learning
    Zhong, Zhen
    Xiao, Guobao
    Zheng, Linxin
    Lu, Yan
    Ma, Jiayi
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1930 - 1939