Spatiotemporal Sensitive Network for Non-Contact Heart Rate Prediction from Facial Videos

被引:0
|
作者
Su, Liying [1 ]
Wang, Yitao [1 ]
Zhai, Dezhao [2 ,3 ]
Shi, Yuping [4 ]
Ding, Yinghao [2 ,3 ]
Gao, Guohua [1 ]
Li, Qinwei [4 ]
Yu, Ming [5 ]
Wu, Hang [5 ,6 ]
机构
[1] Beijing Univ Technol, Coll Mech & Energy Engn, Beijing 100124, Peoples R China
[2] Tianjin Univ Technol, Sch Mech Engn, Tianjin Key Lab Adv Mechatron Syst Design & Intell, Tianjin 300384, Peoples R China
[3] Tianjin Univ Technol, Natl Demonstrat Ctr Expt Mech & Elect Engn Educ, Tianjin 300384, Peoples R China
[4] Civil Aviat Univ China, Tianjin Key Lab Adv Signal Proc, Tianjin 300300, Peoples R China
[5] Acad Mil Sci, Syst Engn Inst, Peoples Liberat Army, Tianjin 300161, Peoples R China
[6] Nankai Univ, Sch Artificial Intelligence, Tianjin 300381, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 20期
关键词
heart rate; spatiotemporal sensitive video Swin transformer; visible and infrared facial videos; multimodal fusion; multi-level fusion;
D O I
10.3390/app14209551
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Heart rate (HR) is an important indicator reflecting the overall physical and mental health of the human body, playing a crucial role in diagnosing cardiovascular and neurological diseases. Recent research has revealed that variations in the light absorption of human skin captured through facial video over the cardiac cycle, due to changes in blood volume, can be utilized for non-contact HR estimation. However, most existing methods rely on single-modal video sources (such as RGB or NIR), which often yield suboptimal results due to noise and the limitations of a single information source. To overcome these challenges, this paper proposes a multimodal information fusion architecture named the spatiotemporal sensitive network (SS-Net) for non-contact heart rate estimation. Firstly, spatiotemporal feature maps are utilized to extract physiological signals from RGB and NIR videos effectively. Next, a spatiotemporal sensitive (SS) module is introduced to extract useful physiological signal information from both RGB and NIR spatiotemporal maps. Finally, a multi-level spatiotemporal context fusion (MLSC) module is designed to fuse and complement information between the visible light and infrared modalities. Then, different levels of fused features are refined in task-specific branches to predict both remote photoplethysmography (rPPG) signals and heart rate (HR) signals. Experiments conducted on three datasets demonstrate that the proposed SS-Net achieves superior performance compared to existing methods.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] NON-CONTACT PULSE RATE MEASUREMENT USING FACIAL VIDEOS
    Ruba, M.
    Jeyakumar, Vijay
    Gurucharan, M. K.
    Kousika, V
    Viveka, S.
    PROCEEDINGS OF 2020 IEEE INTERNATIONAL CONFERENCE ON ADVANCES AND DEVELOPMENTS IN ELECTRICAL AND ELECTRONICS ENGINEERING (ICADEE), 2020, : 180 - 185
  • [2] Heart rate estimation network from facial videos using spatiotemporal feature image
    Jaiswal, Kokila Bharti
    Meenpal, T.
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 151
  • [3] Non-contact Measurement of Heart Rate Based on Facial Video
    Wang, Cheng
    Jiang, Yuling
    Cai, Zhuohao
    Lin, Lan
    2019 PHOTONICS & ELECTROMAGNETICS RESEARCH SYMPOSIUM - FALL (PIERS - FALL), 2019, : 2269 - 2275
  • [4] Non-contact measurement of the heart rate and the heart rate variability
    Kranjec, Jure
    Begus, Samo
    Gersak, Gregor
    Drnovsek, Janko
    ELEKTROTEHNISKI VESTNIK, 2013, 80 (03): : 135 - 140
  • [5] Non-contact Heart Rate Estimation from a Video Sequence
    Taralunga, Dragos Daniel
    Nicolae, Irina
    Hurezeanu, Bogdan
    Ungureanu, Mihaela
    Strungaru, Rodica
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE AND EXPOSITION ON ELECTRICAL AND POWER ENGINEERING (EPE 2016), 2016, : 348 - 351
  • [6] Driver Drowsiness Detection Based on Facial Video Non-Contact Heart Rate Measurement
    Ding, Fusheng
    Qin, Yanbin
    Zhang, Lanxiang
    Lyu, Hongming
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2025, 29 (02) : 306 - 315
  • [7] Non-Contact Psychological Stress Detection Combining Heart Rate Variability and Facial Expressions
    Kong Lingqin
    Chen Fei
    Zhao Yuejin
    Dong Liquan
    Liu Ming
    Hui Mei
    ACTA OPTICA SINICA, 2021, 41 (03)
  • [8] Study on non-contact heart rate monitor
    Nagasawa H.
    Sano M.
    1600, Japan Soc. of Med. Electronics and Biol. Engineering (58) : 590 - 591
  • [9] Non-contact heart rate and heart rate variability measurements: A review
    Kranjec, J.
    Begus, S.
    Gersak, G.
    Drnovsek, J.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2014, 13 : 102 - 112
  • [10] Heart Rate Estimation from Facial Videos Based on Convolutional Neural Network
    Yang, Wen
    Li, Xiaoqi
    Zhang, Bin
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC), 2018, : 45 - 49