Heart Rate and Oxygen Level Estimation from Facial Videos Using a Hybrid Deep Learning Model

被引:1
|
作者
Zheng, Yufeng [1 ]
机构
[1] Univ Mississippi, Med Ctr, Jackson, MS 38677 USA
关键词
Vital sign; Facial video; convolutional neural network (CNN); Convolutional long short-term memory (convLSTM); Video vision transformer (ViViT); Deep learning; Telehealth; NONCONTACT; FUSION;
D O I
10.1117/12.3013956
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vital signs can be inferred from facial videos for health monitoring remotely, while facial videos can be easily obtained through phone cameras, webcams, or surveillance systems. In this study, we propose a hybrid deep learning model to estimate heart rate (HR) and blood oxygen saturation level (SpO2) from facial videos. The hybrid model has a mixed network architecture consisting of convolutional neural network (CNN), convolutional long short-term memory (convLSTM), and video vision transformer (ViViT). Temporal resolution is emphasized in feature extraction since both HR and SpO2 are varying over time. A clip of video consists of a set of frame images within a time segment. CNN is performed with regard to each frame (e.g., time distributed), convLSTM and ViViT can be configured to process a sequence of frames. These high-resolution temporal features are combined to predict HR and SpO2, which are expected to capture these signal variations. Our vital video dataset is fairly large by including 891 subjects from difference races and ages. Facial detection and data normalization are performed in preprocessing. Our experiments show that the proposed hybrid model can predict HR and SpO2 accurately. In addition, those models can be extended to infer HR fluctuations, respiratory rates, and blood pressure variations from facial videos.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Human activity recognition from uav videos using an optimized hybrid deep learning model
    Sinha, Kumari Priyanka
    Kumar, Prabhat
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (17) : 51669 - 51698
  • [22] Human activity recognition from uav videos using an optimized hybrid deep learning model
    Kumari Priyanka Sinha
    Prabhat Kumar
    Multimedia Tools and Applications, 2024, 83 : 51669 - 51698
  • [23] Robust Heart Rate Estimation With Spatial-Temporal Attention Network From Facial Videos
    Hu, Min
    Qian, Fei
    Wang, Xiaohua
    He, Lei
    Guo, Dong
    Ren, Fuji
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (02) : 639 - 647
  • [24] Heart rate estimation from facial videos with motion interference using T-SNE-based signal separation
    Wang, Hequn
    Yang, Xuezhi
    Liu, Xuenan
    Wang, Dingliang
    BIOMEDICAL OPTICS EXPRESS, 2022, 13 (09): : 4494 - 4509
  • [25] Robust Heart Rate Variability Measurement from Facial Videos
    Odinaev, Ismoil
    Wong, Kwan Long
    Chin, Jing Wei
    Goyal, Raghav
    Chan, Tsz Tai
    So, Richard H. Y.
    BIOENGINEERING-BASEL, 2023, 10 (07):
  • [26] A Two-Stream Deep-Learning Network for Heart Rate Estimation From Facial Image Sequence
    Lie, Wen-Nung
    Le, Dao Q.
    Huang, Po-Han
    Fu, Guan-Hao
    Quynh, Anh Nguyen Thi
    Nhu, Quynh Nguyen Quang
    IEEE SENSORS JOURNAL, 2024, 24 (24) : 42343 - 42351
  • [27] Contactless heart rate estimation from face videos
    Lamba, Puneet Singh
    Virmani, Deepali
    JOURNAL OF STATISTICS AND MANAGEMENT SYSTEMS, 2020, 23 (07) : 1275 - 1284
  • [28] Personalized Estimation of Engagement from Videos Using Active Learning with Deep Reinforcement Learning
    Rudovic, Ognjen
    Park, Hae Won
    Busche, John
    Schuller, Bjoern
    Breazeal, Cynthia
    Picard, Rosalind W.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 217 - 226
  • [29] Heart Rate Estimation From Facial Images Using Filter Bank
    Yu, Yong-Poh
    Raveendran, P.
    Lim, Chern-Loon
    2014 6TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING (ISCCSP), 2014, : 69 - 72
  • [30] Remote Heart Rate Measurement from Highly Compressed Facial Videos: an End-to-end Deep Learning Solution with Video Enhancement
    Yu, Zitong
    Peng, Wei
    Li, Xiaobai
    Hong, Xiaopeng
    Zhao, Guoying
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 151 - 160