VPCFormer: A transformer-based multi-view finger vein recognition model and a new benchmark

被引:7
|
作者
Zhao, Pengyang [1 ,2 ]
Song, Yizhuo [2 ]
Wang, Siqi [2 ]
Xue, Jing-Hao [3 ]
Zhao, Shuping [4 ]
Liao, Qingmin [1 ,2 ]
Yang, Wenming [1 ,2 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
[2] Tsinghua Univ, Shenzhen Int Grad Sch, Beijing, Peoples R China
[3] UCL, Dept Stat Sci, London, England
[4] Guangdong Univ Technol, Sch Comp Sci, Guangzhou, Peoples R China
关键词
Database; Multi-view finger vein recognition; Transformer; Attention mechanism;
D O I
10.1016/j.patcog.2023.110170
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the past decade, finger vein authentication garners significant interest. However, most existing databases and algorithms predominantly focused on single-view finger vein recognition. The current projection of vein patterns actually maps a 3D network topology into a 2D plane, which inevitably leads to 3D feature loss and topological ambiguity in 2D images. Additionally, single-view based methods are sensitive to finger rotation and translation in practical applications. So far, there are currently few dedicated studies and public databases on multi-view finger vein recognition. To address these issues, we first establish a benchmark for future research by constructing the multi-view finger vein database, named Tsinghua Multi-View Finger Vein-3 Views (THUMVFV-3V) Database , which is collected over two sessions. THUMVFV-3V provides three types of Regions of Interest (ROIs) and includes unified preprocessing operations, catering to the majority of existing methods. Furthermore, we propose a novel Transformer-based model named Vein Pattern Constrained Transformer (VPCFormer) for multi-view finger vein recognition, primarily composed of multiple Vein Pattern Constrained Encoders (VPC-Encoders) and Neighborhood-Perspective Modules (NPMs). Specifically, the VPC-Encoder incorporates a novel Vein Pattern Attention Module (VPAM) and an Integrative Feed-Forward Network (IFFN). Motivated by the fact that the strong correlations veins exhibit across different views, we devise the VPAM. Assisted by a vein mask, VPAM is meticulously designed to exclusively extract intra-and inter-view dependencies between vein patterns. Further, we propose IFFN to efficiently aggregate the preceding attention and contextual information of VPAM. In addition, the NPM is utilized to capture the correlations within a single view, enhancing the final multi-view finger vein representation. Extensive experiments demonstrate the superiority of our VPCFormer. The THUMVFV-3V database is available at https://github.com/Pengyang233/ THUMVFV-3V-Database.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Attention BLSTM-Based Temporal-Spatial Vein Transformer for Multi-View Finger-Vein Recognition
    Qin, Huafeng
    Xiong, Zhipeng
    Li, Yantao
    El-Yacoubi, Mounim A.
    Wang, Jun
    IEEE Transactions on Information Forensics and Security, 2024,
  • [2] Attention BLSTM-Based Temporal-Spatial Vein Transformer for Multi-View Finger-Vein Recognition
    Qin, Huafeng
    Xiong, Zhipeng
    Li, Yantao
    El-Yacoubi, Mounim A.
    Wang, Jun
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 9330 - 9343
  • [3] MVCformer: A transformer-based multi-view clustering method
    Zhao, Mingyu
    Yang, Weidong
    Nie, Feiping
    INFORMATION SCIENCES, 2023, 649
  • [4] Transformer-Based Contrastive Multi-view Clustering via Ensembles
    Zhao, Mingyu
    Yang, Weidong
    Nie, Feiping
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT I, 2023, 14169 : 678 - 694
  • [5] Hybrid convolutional transformer-based network model for finger vein identification
    Boudjellal, Sif Eddine
    Boukezzoula, Naceur-Eddine
    Boudjelal, Abdelwahhab
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (05)
  • [6] ViT-Cap: A Novel Vision Transformer-Based Capsule Network Model for Finger Vein Recognition
    Li, Yupeng
    Lu, Huimin
    Wang, Yifan
    Gao, Ruoran
    Zhao, Chengcheng
    APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [7] Local Attention Transformer-Based Full-View Finger-Vein Identification
    Qin, Huafeng
    Hu, Rongshan
    El-Yacoubi, Mounim A.
    Li, Yantao
    Gao, Xinbo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (06) : 2767 - 2782
  • [8] Multi-View Gait Recognition Based on a Siamese Vision Transformer
    Yang, Yanchen
    Yun, Lijun
    Li, Ruoyu
    Cheng, Feiyan
    Wang, Kun
    APPLIED SCIENCES-BASEL, 2023, 13 (04):
  • [9] Mammography classification with multi-view deep learning techniques: and transformer-based architectures
    Manigrasso, Francesco
    Milazzo, Rosario
    Russo, Alessandro Sebastian
    Lamberti, Fabrizio
    Strand, Fredrik
    Pagnani, Andrea
    Morra, Lia
    MEDICAL IMAGE ANALYSIS, 2025, 99
  • [10] A Transformer-based Network for Multi-view 3D Mesh Generation
    Shi, Wuzhen
    Liu, Zhijie
    Li, Yingxiang
    Wen, Yang
    Liu, Yutao
    Proceedings - 2023 IEEE SmartWorld, Ubiquitous Intelligence and Computing, Autonomous and Trusted Vehicles, Scalable Computing and Communications, Digital Twin, Privacy Computing and Data Security, Metaverse, SmartWorld/UIC/ATC/ScalCom/DigitalTwin/PCDS/Metaverse 2023, 2023,