VPCFormer: A transformer-based multi-view finger vein recognition model and a new benchmark

被引：7

作者：

Zhao, Pengyang ^{[1
,2
]}

Song, Yizhuo ^{[2
]}

Wang, Siqi ^{[2
]}

Xue, Jing-Hao ^{[3
]}

Zhao, Shuping ^{[4
]}

Liao, Qingmin ^{[1
,2
]}

Yang, Wenming ^{[1
,2
]}

机构：

[1] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China

[2] Tsinghua Univ, Shenzhen Int Grad Sch, Beijing, Peoples R China

[3] UCL, Dept Stat Sci, London, England

[4] Guangdong Univ Technol, Sch Comp Sci, Guangzhou, Peoples R China

来源：

PATTERN RECOGNITION | 2024年 / 148卷

关键词：

Database; Multi-view finger vein recognition; Transformer; Attention mechanism;

D O I：

10.1016/j.patcog.2023.110170

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the past decade, finger vein authentication garners significant interest. However, most existing databases and algorithms predominantly focused on single-view finger vein recognition. The current projection of vein patterns actually maps a 3D network topology into a 2D plane, which inevitably leads to 3D feature loss and topological ambiguity in 2D images. Additionally, single-view based methods are sensitive to finger rotation and translation in practical applications. So far, there are currently few dedicated studies and public databases on multi-view finger vein recognition. To address these issues, we first establish a benchmark for future research by constructing the multi-view finger vein database, named Tsinghua Multi-View Finger Vein-3 Views (THUMVFV-3V) Database , which is collected over two sessions. THUMVFV-3V provides three types of Regions of Interest (ROIs) and includes unified preprocessing operations, catering to the majority of existing methods. Furthermore, we propose a novel Transformer-based model named Vein Pattern Constrained Transformer (VPCFormer) for multi-view finger vein recognition, primarily composed of multiple Vein Pattern Constrained Encoders (VPC-Encoders) and Neighborhood-Perspective Modules (NPMs). Specifically, the VPC-Encoder incorporates a novel Vein Pattern Attention Module (VPAM) and an Integrative Feed-Forward Network (IFFN). Motivated by the fact that the strong correlations veins exhibit across different views, we devise the VPAM. Assisted by a vein mask, VPAM is meticulously designed to exclusively extract intra-and inter-view dependencies between vein patterns. Further, we propose IFFN to efficiently aggregate the preceding attention and contextual information of VPAM. In addition, the NPM is utilized to capture the correlations within a single view, enhancing the final multi-view finger vein representation. Extensive experiments demonstrate the superiority of our VPCFormer. The THUMVFV-3V database is available at https://github.com/Pengyang233/ THUMVFV-3V-Database.

引用

页数：14

共 50 条

[1] Attention BLSTM-Based Temporal-Spatial Vein Transformer for Multi-View Finger-Vein Recognition
Qin, Huafeng
Xiong, Zhipeng
Li, Yantao
El-Yacoubi, Mounim A.
Wang, Jun
IEEE Transactions on Information Forensics and Security, 2024,
[2] Attention BLSTM-Based Temporal-Spatial Vein Transformer for Multi-View Finger-Vein Recognition
Qin, Huafeng
Xiong, Zhipeng
Li, Yantao
El-Yacoubi, Mounim A.
Wang, Jun
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 9330 - 9343
[3] MVCformer: A transformer-based multi-view clustering method
Zhao, Mingyu
Yang, Weidong
Nie, Feiping
INFORMATION SCIENCES, 2023, 649
[4] Transformer-Based Contrastive Multi-view Clustering via Ensembles
Zhao, Mingyu
Yang, Weidong
Nie, Feiping
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT I, 2023, 14169 : 678 - 694
[5] Hybrid convolutional transformer-based network model for finger vein identification
Boudjellal, Sif Eddine
Boukezzoula, Naceur-Eddine
Boudjelal, Abdelwahhab
JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (05)
[6] ViT-Cap: A Novel Vision Transformer-Based Capsule Network Model for Finger Vein Recognition
Li, Yupeng
Lu, Huimin
Wang, Yifan
Gao, Ruoran
Zhao, Chengcheng
APPLIED SCIENCES-BASEL, 2022, 12 (20):
[7] Local Attention Transformer-Based Full-View Finger-Vein Identification
Qin, Huafeng
Hu, Rongshan
El-Yacoubi, Mounim A.
Li, Yantao
Gao, Xinbo
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (06) : 2767 - 2782
[8] Multi-View Gait Recognition Based on a Siamese Vision Transformer
Yang, Yanchen
Yun, Lijun
Li, Ruoyu
Cheng, Feiyan
Wang, Kun
APPLIED SCIENCES-BASEL, 2023, 13 (04):
[9] Mammography classification with multi-view deep learning techniques: and transformer-based architectures
Manigrasso, Francesco
Milazzo, Rosario
Russo, Alessandro Sebastian
Lamberti, Fabrizio
Strand, Fredrik
Pagnani, Andrea
Morra, Lia
MEDICAL IMAGE ANALYSIS, 2025, 99
[10] A Transformer-based Network for Multi-view 3D Mesh Generation
Shi, Wuzhen
Liu, Zhijie
Li, Yingxiang
Wen, Yang
Liu, Yutao
Proceedings - 2023 IEEE SmartWorld, Ubiquitous Intelligence and Computing, Autonomous and Trusted Vehicles, Scalable Computing and Communications, Digital Twin, Privacy Computing and Data Security, Metaverse, SmartWorld/UIC/ATC/ScalCom/DigitalTwin/PCDS/Metaverse 2023, 2023,

← 1 2 3 4 5 →