Representation Learning, Scene Understanding, and Feature Fusion for Drowsiness Detection

被引:11
|
作者
Yu, Jongmin [1 ]
Park, Sangwoo [1 ]
Lee, Sangwook [2 ]
Jeon, Moongu [1 ]
机构
[1] GIST, Dept Elect Engn & Comp Sci, Gwangju, South Korea
[2] Mokwon Univ, Dept Informat Commun Engn, Daejeon, South Korea
关键词
DRIVER; CLASSIFICATION; FATIGUE;
D O I
10.1007/978-3-319-54526-4_13
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We propose a novel drowsiness detection method based on 3D-Deep Convolutional Neural Network (3D-DCNN). We design a learning architecture for the drowsiness detection, which consists of three building blocks for representation learning, scene understanding, and feature fusion. In this framework, the model generates a spatio-temporal representation from multiple consecutive frames and analyze the scene conditions which are defined as head, eye, and mouth movements. The result of analysis from the scene condition understanding model is used to auxiliary information for the drowsiness detection. Then the method subsequently generates fusion features using the spatio-temporal representation and the results of the classification of scene conditions. By using the fusion features, we show that the proposed method can boost the performance of drowsiness detection. The proposed method demonstrates with the NTHU Drowsy Driver Detection (NTHU-DDD) video dataset.
引用
收藏
页码:165 / 177
页数:13
相关论文
共 50 条
  • [41] Feature Fusion Pyramid Network for End-to-End Scene Text Detection
    Wu, Yirui
    Zhang, Lilai
    Li, Hao
    Zhang, Yunfei
    Wan, Shaohua
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (11)
  • [42] Contrastive fusion representation learning for foreground object detection
    Wang, Pei
    Wu, Junsheng
    Fang, Aiqing
    Zhu, Zhixiang
    Wang, Chenwu
    Mu, Pengyuan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [43] Fusion representation learning for foreground moving object detection
    Wang, Pei
    Wu, Junsheng
    Fang, Aiqing
    Zhu, Zhixiang
    Wang, Chenwu
    Ren, Shan
    DIGITAL SIGNAL PROCESSING, 2023, 138
  • [44] Driver Drowsiness Detection based on Multimodal using Fusion of Visual-feature and Bio-signal
    Choi, Hyung-Tak
    Back, Moon-Ki
    Lee, Kyu-Chul
    2018 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2018, : 1249 - 1251
  • [45] Learning deep feature fusion for traffic light detection
    Hassan, Ehtesham
    Khalil, Yasser
    Ahmad, Imtiaz
    JOURNAL OF ENGINEERING RESEARCH, 2024, 12 (01): : 100 - 106
  • [46] Learning deep feature fusion for traffic light detection
    Hassan, Ehtesham
    Khalil, Yasser
    Ahmad, Imtiaz
    JOURNAL OF ENGINEERING RESEARCH, 2023, 11 (03): : 94 - 99
  • [47] Machine Learning Models for Drowsiness Detection
    Meda, Harshit
    Ganesh, Janapareddy Mohan Padmanabha
    Sahani, Ashish
    2021 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC 2021), 2021,
  • [48] In-Place Scene Labelling and Understanding with Implicit Scene Representation
    Zhi, Shuaifeng
    Laidlow, Tristan
    Leutenegger, Stefan
    Davison, Andrew J.
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15818 - 15827
  • [49] A feature representation of sketch based on fusion of sparse coding and deep learning
    Zhao P.
    Gao J.-C.
    Feng C.-C.
    Han L.
    Kongzhi yu Juece/Control and Decision, 2021, 36 (03): : 699 - 704
  • [50] Learning Multimodal Representations for Drowsiness Detection
    Qian, Kun
    Koike, Tomoya
    Nakamura, Toru
    Schuller, Bjoern
    Yamamoto, Yoshiharu
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 11539 - 11548