Speech-based cognitive load monitoring system

被引:55
|
作者
Yin, Bo [1 ]
Chen, Fang [1 ]
Ruiz, Natalie [1 ]
Ambikairajah, Eliathamby [1 ]
机构
[1] NICTA, Eveleigh 1430, Australia
关键词
cognitive load; speech classification;
D O I
10.1109/ICASSP.2008.4518041
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Monitoring cognitive load is important for the prevention of faulty errors in task-critical operations, and the development of adaptive user interfaces, to maintain productivity and efficiency in work performance. Speech, as an objective and non-intrusive measure, is a suitable method for monitoring cognitive load. Existing approaches for cognitive load monitoring are limited in speaker-dependent recognition and need manually labeled data. We propose a novel automatic, speaker-independent classification approach to monitor, in real-time, the person's cognitive load level by using speech features. In this approach, a Gaussian Mixture Model (GMM) based classifier is created with unsupervised training. Channel and speaker normalization are deployed for improving robustness. Different delta techniques are investigated for capturing temporal information. And a background model is introduced to reduce the impact of insufficient training data. The final system achieves 71.1% and 77.5% accuracy on two different tasks, each of which has three discrete cognitive load levels. This performance shows a great potential in real-world applications.
引用
收藏
页码:2041 / 2044
页数:4
相关论文
共 50 条
  • [31] Quester: A Speech-Based Question Answering Support System for Oral Presentations
    Asadi, Reza
    Trinh, Ha
    Fell, Harriet J.
    Bickmore, Timothy W.
    IUI 2018: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES, 2018, : 583 - 594
  • [32] Robust Speech-Based Happiness Recognition
    Lin, Chang-Hong
    Siahaan, Ernestasia
    Chin, Yu-Hau
    Chen, Bo-Wei
    Wang, Jia-Ching
    Wang, Jhing-Fa
    1ST INTERNATIONAL CONFERENCE ON ORANGE TECHNOLOGIES (ICOT 2013), 2013, : 227 - 230
  • [33] A Speech-Based AI for Political Participation
    Braeuer, Paula
    Mazarakis, Athanasios
    MUC 2022: PROCEEDINGS OF MENSCH UND COMPUTER 2022, 2022, : 462 - 466
  • [34] Unsupervised Filterbank Learning for Speech-based Access System for Agricultural Commodity
    Sailor, Hardik B.
    Patil, Hemant A.
    Rajpal, Avni
    2017 NINTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2017, : 210 - 215
  • [35] "Assistance-On-Demand": a Speech-Based Assistance System for Urban Intersections
    Schoemig, Nadja
    Maag, Christian
    Heckmann, Martin
    Neukum, Alexandra
    Wersing, Heiko
    AUTOMOTIVEUI 2016: 8TH INTERNATIONAL CONFERENCE ON AUTOMOTIVE USER INTERFACES AND INTERACTIVE VEHICULAR APPLICATIONS, 2016, : 51 - 56
  • [36] A review of speech-based bimodal recognition
    Chibelushi, CC
    Deravi, F
    Mason, JSD
    IEEE TRANSACTIONS ON MULTIMEDIA, 2002, 4 (01) : 23 - 37
  • [37] Usability engineering of speech-based services
    Sidhu, Charanjit K.
    Coyle, Gerry
    British Telecommunications Engineering, 1996, 14 (pt 4): : 337 - 340
  • [38] Deploying a speech-based information system as a research platform for speech recognition research in real environments
    Nishimura, R
    Nishihara, Y
    Tsurumi, R
    Lee, A
    Saruwatari, H
    Shikano, K
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2005, 88 (12): : 43 - 54
  • [39] Speech-based interaction with in-vehicle computers: The effect of speech-based e-mail on drivers' attention to the roadway
    Lee, JD
    Caven, B
    Haake, S
    Brown, TL
    HUMAN FACTORS, 2001, 43 (04) : 631 - 640
  • [40] Employing Bottleneck and Convolutional Features for Speech-Based Physical Load Detection on Limited Data Amounts
    Egorow, Olga
    Mrech, Tarik
    Weisskirchen, Norman
    Wendemuth, Andreas
    INTERSPEECH 2019, 2019, : 1666 - 1670