MSDN: A Multistage Deep Network for Heart-Rate Estimation From Facial Videos

被引:5
|
作者
Zhang, Xiaobiao [1 ]
Xia, Zhaoqiang [1 ,2 ]
Dai, Jing [3 ]
Liu, Lili [1 ]
Peng, Jinye [4 ]
Feng, Xiaoyi [1 ,5 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China
[2] Innovat Ctr NPU Chongqing, Chongqing 401120, Peoples R China
[3] China Acad Launch Vehicle Technol, Beijing 100076, Peoples R China
[4] Northwest Univ, Sch Informat Sci & Technol, Xian 710127, Peoples R China
[5] Res & Dev Inst NPU Shenzhen, Shenzhen 518057, Peoples R China
关键词
Feature extraction; Estimation; Heart rate; Videos; Training; Band-pass filters; Skin; Feature extractor; heart rate (HR) estimation; interbeat interval (IBI); multistage deep network (MSDN); remote photoplethysmography (rPPG) generator; NONCONTACT; PPG; CNN;
D O I
10.1109/TIM.2023.3329095
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Noncontact heart-rate (HR) measurement is a very important trend in clinical medicine. Recently, a variety of deep networks have been applied to estimate HRs from facial videos. However, due to limited data resources and poor parameter optimization, few existing models have achieved incredible performance in complicated scenarios, such as those with illumination changes, different skin tones, and facial motion. To address these challenges, this article proposes a novel multistage deep network (MSDN) that can decentralize the learnable parameters into different stages to reduce the difficulty of learning through multiple training steps. Specifically, the proposed network consists of three stages in an end-to-end way. In the first stage, an HR-aware feature extractor uses the next convolutional neural network (ConvNeXt) embedded with a newly designed bandpass filter as its backbone to extract spatial-temporal features for determining HR changes. Moreover, pseudolabels are generated to make the features compatible with illumination, motion, and color variance. In the second stage, various modules, including singular value decomposition (SVD) pooling and enhanced difference convolution (EDC) modules, are then designed and combined with a transformer encoder to construct a feature-compressed remote photoplethysmography (rPPG) generator. In the third stage, an HR estimator with an interbeat interval (IBI) analyzer and a 1-D filter is newly designed for HR estimation. Extensive experiments are performed on three publicly available databases (i.e., VIPL-HR, COHFACE, and PURE), and the results demonstrate the effectiveness of the proposed method through ablation studies and comparison experiments with state-of-the-art (SOTA) methods.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [1] A deep learning framework for heart rate estimation from facial videos
    Hsu, Gee-Sern Jison
    Xie, Rui-Cang
    Ambikapathi, ArulMurugan
    Chou, Kae-Jy
    NEUROCOMPUTING, 2020, 417 : 155 - 166
  • [2] Robust heart-rate estimation from facial videos using Project_ICA
    Qi, Lin
    Yu, Huidong
    Xu, Lisheng
    Mpanda, Ramadhani Selemani
    Greenwald, Stephen E.
    PHYSIOLOGICAL MEASUREMENT, 2019, 40 (08)
  • [3] Heart Rate Estimation from Facial Videos Based on Convolutional Neural Network
    Yang, Wen
    Li, Xiaoqi
    Zhang, Bin
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC), 2018, : 45 - 49
  • [4] Heart rate estimation network from facial videos using spatiotemporal feature image
    Jaiswal, Kokila Bharti
    Meenpal, T.
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 151
  • [5] Information-Enhanced Network for Noncontact Heart Rate Estimation From Facial Videos
    Liu, Lili
    Xia, Zhaoqiang
    Zhang, Xiaobiao
    Peng, Jinye
    Feng, Xiaoyi
    Zhao, Guoying
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2136 - 2150
  • [6] Heart Rate Estimation From Facial Videos for Depression Analysis
    Mustafa, Aamir
    Bhatia, Shalini
    Hayat, Munawar
    Goecke, Roland
    2017 SEVENTH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2017, : 498 - 503
  • [7] Robust Heart Rate Estimation With Spatial-Temporal Attention Network From Facial Videos
    Hu, Min
    Qian, Fei
    Wang, Xiaohua
    He, Lei
    Guo, Dong
    Ren, Fuji
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (02) : 639 - 647
  • [8] Heart Rate and Oxygen Level Estimation from Facial Videos Using a Hybrid Deep Learning Model
    Zheng, Yufeng
    MULTIMODAL IMAGE EXPLOITATION AND LEARNING 2024, 2024, 13033
  • [9] Supervised Learning Approach to Remote Heart Rate Estimation from Facial Videos
    Osman, Ahmed
    Turcot, Jay
    El Kaliouby, Rana
    2015 11TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG), VOL. 1, 2015,
  • [10] Deep face segmentation for improved heart and respiratory rate estimation from videos
    Fiedler M.-A.
    Werner P.
    Rapczyński M.
    Al-Hamadi A.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (07) : 9383 - 9402