VPCFormer: A transformer-based multi-view finger vein recognition model and a new benchmark

被引：7

作者：

Zhao, Pengyang ^{[1
,2
]}

Song, Yizhuo ^{[2
]}

Wang, Siqi ^{[2
]}

Xue, Jing-Hao ^{[3
]}

Zhao, Shuping ^{[4
]}

Liao, Qingmin ^{[1
,2
]}

Yang, Wenming ^{[1
,2
]}

机构：

[1] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China

[2] Tsinghua Univ, Shenzhen Int Grad Sch, Beijing, Peoples R China

[3] UCL, Dept Stat Sci, London, England

[4] Guangdong Univ Technol, Sch Comp Sci, Guangzhou, Peoples R China

来源：

PATTERN RECOGNITION | 2024年 / 148卷

关键词：

Database; Multi-view finger vein recognition; Transformer; Attention mechanism;

D O I：

10.1016/j.patcog.2023.110170

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the past decade, finger vein authentication garners significant interest. However, most existing databases and algorithms predominantly focused on single-view finger vein recognition. The current projection of vein patterns actually maps a 3D network topology into a 2D plane, which inevitably leads to 3D feature loss and topological ambiguity in 2D images. Additionally, single-view based methods are sensitive to finger rotation and translation in practical applications. So far, there are currently few dedicated studies and public databases on multi-view finger vein recognition. To address these issues, we first establish a benchmark for future research by constructing the multi-view finger vein database, named Tsinghua Multi-View Finger Vein-3 Views (THUMVFV-3V) Database , which is collected over two sessions. THUMVFV-3V provides three types of Regions of Interest (ROIs) and includes unified preprocessing operations, catering to the majority of existing methods. Furthermore, we propose a novel Transformer-based model named Vein Pattern Constrained Transformer (VPCFormer) for multi-view finger vein recognition, primarily composed of multiple Vein Pattern Constrained Encoders (VPC-Encoders) and Neighborhood-Perspective Modules (NPMs). Specifically, the VPC-Encoder incorporates a novel Vein Pattern Attention Module (VPAM) and an Integrative Feed-Forward Network (IFFN). Motivated by the fact that the strong correlations veins exhibit across different views, we devise the VPAM. Assisted by a vein mask, VPAM is meticulously designed to exclusively extract intra-and inter-view dependencies between vein patterns. Further, we propose IFFN to efficiently aggregate the preceding attention and contextual information of VPAM. In addition, the NPM is utilized to capture the correlations within a single view, enhancing the final multi-view finger vein representation. Extensive experiments demonstrate the superiority of our VPCFormer. The THUMVFV-3V database is available at https://github.com/Pengyang233/ THUMVFV-3V-Database.

引用

页数：14

共 50 条

[21] RM-Transformer: A Transformer-based Model for Mandarin Speech Recognition
Lu, Xingyu
Hu, Jianguo
Li, Shenhao
Ding, Yanyu
2022 IEEE 2ND INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND ARTIFICIAL INTELLIGENCE (CCAI 2022), 2022, : 194 - 198
[22] Finger vein recognition based on multi-instance
Yang, Ying
Yang, Gongping
Wang, Shibing
International Journal of Digital Content Technology and its Applications, 2012, 6 (11) : 86 - 94
[23] Multi-Level Transformer-Based Social Relation Recognition
Wang, Yuchen
Qing, Linbo
Wang, Zhengyong
Cheng, Yongqiang
Peng, Yonghong
SENSORS, 2022, 22 (15)
[24] Research on Finger Vein Recognition Algorithm Based on Wavelet-Transformer
Yang, Shuqiang
Wang, Zhaodi
Qin, Huafeng
Liu, Yike
Wang, Junqiang
Electronics Letters, 2025, 61 (01)
[25] Head nod and shake recognition based on multi-view model and Hidden Markov Model
Lu, P
Zhang, MD
Zhu, XS
Wang, YS
COMPUTER GRAPHICS, IMAGING AND VISION: NEW TRENDS, 2005, : 61 - 64
[26] Transformer Based Multi-view Network for Mammographic Image Classification
Sun, Zizhao
Jiang, Huiqin
Ma, Ling
Yu, Zhan
Xu, Hongwei
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT III, 2022, 13433 : 46 - 54
[27] MMRAN: A novel model for finger vein recognition based on a residual attention mechanismMMRAN: A novel finger vein recognition model
Weiye Liu
Huimin Lu
Yifan Wang
Yupeng Li
Zhenshen Qu
Yang Li
Applied Intelligence, 2023, 53 : 3273 - 3290
[28] Multi-view convolutional vision transformer for 3D object recognition
Li, Jie
Liu, Zhao
Li, Li
Lin, Junqin
Yao, Jian
Tu, Jingmin
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95
[29] Face Recognition Based on Multi-view Ensemble Learning
Shi, Wenhui
Jiang, Mingyan
PATTERN RECOGNITION AND COMPUTER VISION, PT III, 2018, 11258 : 127 - 136
[30] EMOTION RECOGNITION BASED ON MULTI-VIEW BODY GESTURES
Shen, Zhijuan
Cheng, Jun
Hu, Xiping
Dong, Qian
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3317 - 3321

← 1 2 3 4 5 →