Speech-based cognitive load monitoring system

被引:55
|
作者
Yin, Bo [1 ]
Chen, Fang [1 ]
Ruiz, Natalie [1 ]
Ambikairajah, Eliathamby [1 ]
机构
[1] NICTA, Eveleigh 1430, Australia
关键词
cognitive load; speech classification;
D O I
10.1109/ICASSP.2008.4518041
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Monitoring cognitive load is important for the prevention of faulty errors in task-critical operations, and the development of adaptive user interfaces, to maintain productivity and efficiency in work performance. Speech, as an objective and non-intrusive measure, is a suitable method for monitoring cognitive load. Existing approaches for cognitive load monitoring are limited in speaker-dependent recognition and need manually labeled data. We propose a novel automatic, speaker-independent classification approach to monitor, in real-time, the person's cognitive load level by using speech features. In this approach, a Gaussian Mixture Model (GMM) based classifier is created with unsupervised training. Channel and speaker normalization are deployed for improving robustness. Different delta techniques are investigated for capturing temporal information. And a background model is introduced to reduce the impact of insufficient training data. The final system achieves 71.1% and 77.5% accuracy on two different tasks, each of which has three discrete cognitive load levels. This performance shows a great potential in real-world applications.
引用
收藏
页码:2041 / 2044
页数:4
相关论文
共 50 条
  • [41] Speaker normalisation for speech-based emotion detection
    Sethu, Vidhyasaharan
    Ambikairajah, Eliathainby
    Epps, Julien
    PROCEEDINGS OF THE 2007 15TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, 2007, : 611 - +
  • [42] VOICE: a framework for speech-based mobile systems
    Sharp, Adam
    Kurkovsky, Stan
    21ST INTERNATIONAL CONFERENCE ON ADVANCED NETWORKING AND APPLICATIONS WORKSHOPS/SYMPOSIA, VOL 2, PROCEEDINGS, 2007, : 38 - +
  • [43] Speech-Based L2 Call System for English Foreign Speakers
    Ateeq, Mohammad
    Hanani, Abualsoud
    SPEECH AND COMPUTER, SPECOM 2019, 2019, 11658 : 43 - 53
  • [44] Automated remote speech-based testing of individuals with cognitive decline: Bayesian agreement of transcription accuracy
    Koenig, Alexandra
    Koehler, Stefanie
    Troeger, Johannes
    Duezel, Emrah
    Glanz, Wenzel
    Butryn, Michaela
    Mallick, Elisa
    Priller, Josef
    Altenstein, Slawek
    Spottke, Annika
    Kimmich, Okka
    Falkenburger, Bjoern
    Osterrath, Antje
    Wiltfang, Jens
    Bartels, Claudia
    Kilimann, Ingo
    Laske, Christoph
    Munk, Matthias H.
    Roeske, Sandra
    Frommann, Ingo
    Hoffmann, Daniel C.
    Jessen, Frank
    Wagner, Michael
    Linz, Nicklas
    Teipel, Stefan
    ALZHEIMER'S & DEMENTIA: DIAGNOSIS, ASSESSMENT & DISEASE MONITORING, 2024, 16 (04)
  • [45] Speech-Based Annotation and Retrieval of Digital Photographs
    Hazen, Timothy J.
    Sherry, Brennan
    Adler, Mark
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2077 - +
  • [46] An Exploration of Speech-Based Productivity Support in the Car
    Martelaro, Nikolas
    Teevan, Jaime
    Iqbal, Shamsi T.
    CHI 2019: PROCEEDINGS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
  • [47] Speech-based Interaction: Myths, Challenges, and Opportunities
    Munteanu, Cosmin
    Penn, Gerald
    PROCEEDINGS OF THE 16TH ACM INTERNATIONAL CONFERENCE ON HUMAN-COMPUTER INTERACTION WITH MOBILE DEVICES AND SERVICES (MOBILEHCI'14), 2014, : 567 - 568
  • [48] Investigating cognitive workload in irrelevant speech-based information communication with visual distractions: Pleasant or distracted?
    Liu, Li
    Duffy, Vincent G.
    INTERNATIONAL JOURNAL OF INDUSTRIAL ERGONOMICS, 2024, 99
  • [49] The SRI Speech-Based Collaborative Learning Corpus
    Richey, Colleen
    D'Angelo, Cynthia
    Alozie, Nonye
    Bratt, Harry
    Shriberg, Elizabeth
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1550 - 1554
  • [50] Speech-Based Interface For Visually Impaired Users
    Huang, Yi-Chin
    Tsai, Cheng-Hung
    IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 1223 - 1228