VPCFormer: A transformer-based multi-view finger vein recognition model and a new benchmark

被引:7
|
作者
Zhao, Pengyang [1 ,2 ]
Song, Yizhuo [2 ]
Wang, Siqi [2 ]
Xue, Jing-Hao [3 ]
Zhao, Shuping [4 ]
Liao, Qingmin [1 ,2 ]
Yang, Wenming [1 ,2 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
[2] Tsinghua Univ, Shenzhen Int Grad Sch, Beijing, Peoples R China
[3] UCL, Dept Stat Sci, London, England
[4] Guangdong Univ Technol, Sch Comp Sci, Guangzhou, Peoples R China
关键词
Database; Multi-view finger vein recognition; Transformer; Attention mechanism;
D O I
10.1016/j.patcog.2023.110170
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the past decade, finger vein authentication garners significant interest. However, most existing databases and algorithms predominantly focused on single-view finger vein recognition. The current projection of vein patterns actually maps a 3D network topology into a 2D plane, which inevitably leads to 3D feature loss and topological ambiguity in 2D images. Additionally, single-view based methods are sensitive to finger rotation and translation in practical applications. So far, there are currently few dedicated studies and public databases on multi-view finger vein recognition. To address these issues, we first establish a benchmark for future research by constructing the multi-view finger vein database, named Tsinghua Multi-View Finger Vein-3 Views (THUMVFV-3V) Database , which is collected over two sessions. THUMVFV-3V provides three types of Regions of Interest (ROIs) and includes unified preprocessing operations, catering to the majority of existing methods. Furthermore, we propose a novel Transformer-based model named Vein Pattern Constrained Transformer (VPCFormer) for multi-view finger vein recognition, primarily composed of multiple Vein Pattern Constrained Encoders (VPC-Encoders) and Neighborhood-Perspective Modules (NPMs). Specifically, the VPC-Encoder incorporates a novel Vein Pattern Attention Module (VPAM) and an Integrative Feed-Forward Network (IFFN). Motivated by the fact that the strong correlations veins exhibit across different views, we devise the VPAM. Assisted by a vein mask, VPAM is meticulously designed to exclusively extract intra-and inter-view dependencies between vein patterns. Further, we propose IFFN to efficiently aggregate the preceding attention and contextual information of VPAM. In addition, the NPM is utilized to capture the correlations within a single view, enhancing the final multi-view finger vein representation. Extensive experiments demonstrate the superiority of our VPCFormer. The THUMVFV-3V database is available at https://github.com/Pengyang233/ THUMVFV-3V-Database.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] RM-Transformer: A Transformer-based Model for Mandarin Speech Recognition
    Lu, Xingyu
    Hu, Jianguo
    Li, Shenhao
    Ding, Yanyu
    2022 IEEE 2ND INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND ARTIFICIAL INTELLIGENCE (CCAI 2022), 2022, : 194 - 198
  • [22] Finger vein recognition based on multi-instance
    Yang, Ying
    Yang, Gongping
    Wang, Shibing
    International Journal of Digital Content Technology and its Applications, 2012, 6 (11) : 86 - 94
  • [23] Multi-Level Transformer-Based Social Relation Recognition
    Wang, Yuchen
    Qing, Linbo
    Wang, Zhengyong
    Cheng, Yongqiang
    Peng, Yonghong
    SENSORS, 2022, 22 (15)
  • [24] Research on Finger Vein Recognition Algorithm Based on Wavelet-Transformer
    Yang, Shuqiang
    Wang, Zhaodi
    Qin, Huafeng
    Liu, Yike
    Wang, Junqiang
    Electronics Letters, 2025, 61 (01)
  • [25] Head nod and shake recognition based on multi-view model and Hidden Markov Model
    Lu, P
    Zhang, MD
    Zhu, XS
    Wang, YS
    COMPUTER GRAPHICS, IMAGING AND VISION: NEW TRENDS, 2005, : 61 - 64
  • [26] Transformer Based Multi-view Network for Mammographic Image Classification
    Sun, Zizhao
    Jiang, Huiqin
    Ma, Ling
    Yu, Zhan
    Xu, Hongwei
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT III, 2022, 13433 : 46 - 54
  • [27] MMRAN: A novel model for finger vein recognition based on a residual attention mechanismMMRAN: A novel finger vein recognition model
    Weiye Liu
    Huimin Lu
    Yifan Wang
    Yupeng Li
    Zhenshen Qu
    Yang Li
    Applied Intelligence, 2023, 53 : 3273 - 3290
  • [28] Multi-view convolutional vision transformer for 3D object recognition
    Li, Jie
    Liu, Zhao
    Li, Li
    Lin, Junqin
    Yao, Jian
    Tu, Jingmin
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95
  • [29] Face Recognition Based on Multi-view Ensemble Learning
    Shi, Wenhui
    Jiang, Mingyan
    PATTERN RECOGNITION AND COMPUTER VISION, PT III, 2018, 11258 : 127 - 136
  • [30] EMOTION RECOGNITION BASED ON MULTI-VIEW BODY GESTURES
    Shen, Zhijuan
    Cheng, Jun
    Hu, Xiping
    Dong, Qian
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3317 - 3321