Representation Learning, Scene Understanding, and Feature Fusion for Drowsiness Detection

被引:11
|
作者
Yu, Jongmin [1 ]
Park, Sangwoo [1 ]
Lee, Sangwook [2 ]
Jeon, Moongu [1 ]
机构
[1] GIST, Dept Elect Engn & Comp Sci, Gwangju, South Korea
[2] Mokwon Univ, Dept Informat Commun Engn, Daejeon, South Korea
关键词
DRIVER; CLASSIFICATION; FATIGUE;
D O I
10.1007/978-3-319-54526-4_13
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We propose a novel drowsiness detection method based on 3D-Deep Convolutional Neural Network (3D-DCNN). We design a learning architecture for the drowsiness detection, which consists of three building blocks for representation learning, scene understanding, and feature fusion. In this framework, the model generates a spatio-temporal representation from multiple consecutive frames and analyze the scene conditions which are defined as head, eye, and mouth movements. The result of analysis from the scene condition understanding model is used to auxiliary information for the drowsiness detection. Then the method subsequently generates fusion features using the spatio-temporal representation and the results of the classification of scene conditions. By using the fusion features, we show that the proposed method can boost the performance of drowsiness detection. The proposed method demonstrates with the NTHU Drowsy Driver Detection (NTHU-DDD) video dataset.
引用
收藏
页码:165 / 177
页数:13
相关论文
共 50 条
  • [1] Representation Learning for Semantic Scene Understanding
    Farshad, Azade
    HHAI 2023: AUGMENTING HUMAN INTELLECT, 2023, 368 : 445 - 458
  • [2] Feature Fusion for Scene Text Detection
    Zhu, Zhen
    Liao, Minghui
    Shi, Baoguang
    Bai, Xiang
    2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, : 193 - 198
  • [3] Driver Drowsiness Detection System Based on Feature Representation Learning Using Various Deep Networks
    Park, Sanghyuk
    Pan, Fei
    Kang, Sunghun
    Yoo, Chang D.
    COMPUTER VISION - ACCV 2016 WORKSHOPS, PT III, 2017, 10118 : 154 - 164
  • [4] FEATURE FUSION NETWORK FOR SCENE TEXT DETECTION
    Cai, Chenqin
    Lv, Pin
    Su, Bing
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2755 - 2759
  • [5] Performance Analysis of Holistic Feature Representation for Scene Understanding and Classification
    Fu Yi
    Tian Chang
    Wu Ze Min
    Zeng Ming Yong
    Hu Yinji
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 3756 - 3760
  • [6] Object-Centric Representation Learning for Video Scene Understanding
    Zhou, Yi
    Zhang, Hui
    Park, Seung-In
    Yoo, ByungIn
    Qi, Xiaojuan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 8410 - 8423
  • [7] Feature Selection for Driver Drowsiness Detection
    Panda, Saurav
    Kolhekar, Megha
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA ENGINEERING (ICCIDE 2018), 2019, 28 : 127 - 140
  • [8] A Confounder-Free Fusion Network for Aerial Image Scene Feature Representation
    Xiong, Wei
    Xiong, Zhenyu
    Cui, Yaqi
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 5440 - 5454
  • [9] Towards Drowsiness Driving Detection Based on Multi-Feature Fusion and LSTM Networks
    Hong, Lin
    Wang, Xin
    16TH IEEE INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2020), 2020, : 732 - 736
  • [10] Binary feature representation learning for scene retrieval in micro-video
    Guo, Jie
    Nie, Xiushan
    Jian, Muwei
    Yin, Yilong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (17) : 24539 - 24552