MSSTNet: Multi-scale facial videos pulse extraction network based on separable spatiotemporal convolution and dimension separable attention

被引：0

作者：

Changchen ZHAO ^{[1
,2
]}

Hongsheng WANG ^{[1
]}

Yuanjing FENG ^{[1
]}

机构：

[1] College of Information Engineering, Zhejiang University of Technology

[2] Hangzhou Innovation Institute, Beihang University

来源：

虚拟现实与智能硬件(中英文) | 2023年 / 5卷 / 02期

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TN911.7 [信号处理]; TP391.41 []; R318 [生物医学工程];

学科分类号：

0711 ; 080401 ; 080402 ; 080203 ; 0831 ;

摘要：

Background The use of remote photoplethysmography(rPPG) to estimate blood volume pulse in a noncontact manner has been an active research topic in recent years. Existing methods are primarily based on a singlescale region of interest(ROI). However, some noise signals that are not easily separated in a single-scale space can be easily separated in a multi-scale space. Also, existing spatiotemporal networks mainly focus on local spatiotemporal information and do not emphasize temporal information, which is crucial in pulse extraction problems,resulting in insufficient spatiotemporal feature modelling. Methods Here, we propose a multi-scale facial video pulse extraction network based on separable spatiotemporal convolution(SSTC) and dimension separable attention(DSAT). First, to solve the problem of a single-scale ROI, we constructed a multi-scale feature space for initial signal separation. Second, SSTC and DSAT were designed for efficient spatiotemporal correlation modeling, which increased the information interaction between the long-span time and space dimensions; this placed more emphasis on temporal features. Results The signal-to-noise ratio(SNR) of the proposed network reached 9.58dB on the PURE dataset and 6.77dB on the UBFC-rPPG dataset, outperforming state-of-the-art algorithms. Conclusions The results showed that fusing multi-scale signals yielded better results than methods based on only single-scale signals.The proposed SSTC and dimension-separable attention mechanism will contribute to more accurate pulse signal extraction.

引用

页码：124 / 141

页数：18

共 50 条

[1] MSSTNet: Multi-scale facial videos pulse extraction network based on separable spatiotemporal convolution and dimension separable attention
Zhao, Changchen
Wang, Hongsheng
Feng, Yuanjing
Virtual Reality and Intelligent Hardware, 2023, 5 (02): : 124 - 141
[2] Multi-scale depthwise separable convolution facial expression recognition embedded in attention mechanism
Song Y.
Gao S.
Zeng H.
Xiong G.
Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2022, 48 (12): : 2381 - 2387
[3] The Multi-Scale Depth-Separable Convolution Network for Fire and Smoke Detection
Yan, Huihui
Cui, Zhihua
Zhao, Haotian
Zhang, Jingbo
Qin, Juanjuan
Guo, Qian
COMBUSTION SCIENCE AND TECHNOLOGY, 2024,
[4] Enhancing P300 Feature Extraction through Multi-Scale Separable Convolution
Liu, Maohua
Ahmad, Faraz
Hao, Jianwei
Mehmood, Muhammad
Ahmed, Ishfaque
Beyette, Fred R., Jr.
SOUTHEASTCON 2024, 2024, : 1420 - 1425
[5] MSSTNet: A Multi-Scale Spatiotemporal Prediction Neural Network for Precipitation Nowcasting
Ye, Yuankang
Gao, Feng
Cheng, Wei
Liu, Chang
Zhang, Shaoqing
REMOTE SENSING, 2023, 15 (01)
[6] Multi-scale Xception based depthwise separable convolution for single image super-resolution
Muhammad, Wazir
Aramvith, Supavadee
Onoye, Takao
PLOS ONE, 2021, 16 (08):
[7] Small object detection based on hierarchical attention mechanism and multi-scale separable detection
Zhang, Yafeng
Yu, Junyang
Wang, Yuanyuan
Tang, Shuang
Li, Han
Xin, Zhiyi
Wang, Chaoyi
Zhao, Ziming
IET IMAGE PROCESSING, 2023, 17 (14) : 3986 - 3999
[8] Multi-Scale Residual Depthwise Separable Convolution for Metro Passenger Flow Prediction
Li, Taoying
Liu, Lu
Li, Meng
APPLIED SCIENCES-BASEL, 2023, 13 (20):
[9] A multi-attention and depthwise separable convolution network for medical image segmentation
Zhou, Yuxiang
Kang, Xin
Ren, Fuji
Lu, Huimin
Nakagawa, Satoshi
Shan, Xiao
NEUROCOMPUTING, 2024, 564
[10] JAMSNet: A Remote Pulse Extraction Network Based on Joint Attention and Multi-Scale Fusion
Zhao, Changchen
Wang, Hongsheng
Chen, Huiling
Shi, Weiwei
Feng, Yuanjing
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (06) : 2783 - 2797

← 1 2 3 4 5 →