MSSTNet: Multi-scale facial videos pulse extraction network based on separable spatiotemporal convolution and dimension separable attention

被引:0
|
作者
Changchen ZHAO [1 ,2 ]
Hongsheng WANG [1 ]
Yuanjing FENG [1 ]
机构
[1] College of Information Engineering, Zhejiang University of Technology
[2] Hangzhou Innovation Institute, Beihang University
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TN911.7 [信号处理]; TP391.41 []; R318 [生物医学工程];
学科分类号
0711 ; 080401 ; 080402 ; 080203 ; 0831 ;
摘要
Background The use of remote photoplethysmography(rPPG) to estimate blood volume pulse in a noncontact manner has been an active research topic in recent years. Existing methods are primarily based on a singlescale region of interest(ROI). However, some noise signals that are not easily separated in a single-scale space can be easily separated in a multi-scale space. Also, existing spatiotemporal networks mainly focus on local spatiotemporal information and do not emphasize temporal information, which is crucial in pulse extraction problems,resulting in insufficient spatiotemporal feature modelling. Methods Here, we propose a multi-scale facial video pulse extraction network based on separable spatiotemporal convolution(SSTC) and dimension separable attention(DSAT). First, to solve the problem of a single-scale ROI, we constructed a multi-scale feature space for initial signal separation. Second, SSTC and DSAT were designed for efficient spatiotemporal correlation modeling, which increased the information interaction between the long-span time and space dimensions; this placed more emphasis on temporal features. Results The signal-to-noise ratio(SNR) of the proposed network reached 9.58dB on the PURE dataset and 6.77dB on the UBFC-rPPG dataset, outperforming state-of-the-art algorithms. Conclusions The results showed that fusing multi-scale signals yielded better results than methods based on only single-scale signals.The proposed SSTC and dimension-separable attention mechanism will contribute to more accurate pulse signal extraction.
引用
收藏
页码:124 / 141
页数:18
相关论文
共 50 条
  • [1] MSSTNet: Multi-scale facial videos pulse extraction network based on separable spatiotemporal convolution and dimension separable attention
    Zhao, Changchen
    Wang, Hongsheng
    Feng, Yuanjing
    Virtual Reality and Intelligent Hardware, 2023, 5 (02): : 124 - 141
  • [2] Multi-scale depthwise separable convolution facial expression recognition embedded in attention mechanism
    Song Y.
    Gao S.
    Zeng H.
    Xiong G.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2022, 48 (12): : 2381 - 2387
  • [3] The Multi-Scale Depth-Separable Convolution Network for Fire and Smoke Detection
    Yan, Huihui
    Cui, Zhihua
    Zhao, Haotian
    Zhang, Jingbo
    Qin, Juanjuan
    Guo, Qian
    COMBUSTION SCIENCE AND TECHNOLOGY, 2024,
  • [4] Enhancing P300 Feature Extraction through Multi-Scale Separable Convolution
    Liu, Maohua
    Ahmad, Faraz
    Hao, Jianwei
    Mehmood, Muhammad
    Ahmed, Ishfaque
    Beyette, Fred R., Jr.
    SOUTHEASTCON 2024, 2024, : 1420 - 1425
  • [5] MSSTNet: A Multi-Scale Spatiotemporal Prediction Neural Network for Precipitation Nowcasting
    Ye, Yuankang
    Gao, Feng
    Cheng, Wei
    Liu, Chang
    Zhang, Shaoqing
    REMOTE SENSING, 2023, 15 (01)
  • [6] Multi-scale Xception based depthwise separable convolution for single image super-resolution
    Muhammad, Wazir
    Aramvith, Supavadee
    Onoye, Takao
    PLOS ONE, 2021, 16 (08):
  • [7] Small object detection based on hierarchical attention mechanism and multi-scale separable detection
    Zhang, Yafeng
    Yu, Junyang
    Wang, Yuanyuan
    Tang, Shuang
    Li, Han
    Xin, Zhiyi
    Wang, Chaoyi
    Zhao, Ziming
    IET IMAGE PROCESSING, 2023, 17 (14) : 3986 - 3999
  • [8] Multi-Scale Residual Depthwise Separable Convolution for Metro Passenger Flow Prediction
    Li, Taoying
    Liu, Lu
    Li, Meng
    APPLIED SCIENCES-BASEL, 2023, 13 (20):
  • [9] A multi-attention and depthwise separable convolution network for medical image segmentation
    Zhou, Yuxiang
    Kang, Xin
    Ren, Fuji
    Lu, Huimin
    Nakagawa, Satoshi
    Shan, Xiao
    NEUROCOMPUTING, 2024, 564
  • [10] JAMSNet: A Remote Pulse Extraction Network Based on Joint Attention and Multi-Scale Fusion
    Zhao, Changchen
    Wang, Hongsheng
    Chen, Huiling
    Shi, Weiwei
    Feng, Yuanjing
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (06) : 2783 - 2797