Speech-based cognitive load monitoring system

被引：55

作者：

Yin, Bo ^{[1
]}

Chen, Fang ^{[1
]}

Ruiz, Natalie ^{[1
]}

Ambikairajah, Eliathamby ^{[1
]}

机构：

[1] NICTA, Eveleigh 1430, Australia

来源：

2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年

关键词：

cognitive load; speech classification;

D O I：

10.1109/ICASSP.2008.4518041

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Monitoring cognitive load is important for the prevention of faulty errors in task-critical operations, and the development of adaptive user interfaces, to maintain productivity and efficiency in work performance. Speech, as an objective and non-intrusive measure, is a suitable method for monitoring cognitive load. Existing approaches for cognitive load monitoring are limited in speaker-dependent recognition and need manually labeled data. We propose a novel automatic, speaker-independent classification approach to monitor, in real-time, the person's cognitive load level by using speech features. In this approach, a Gaussian Mixture Model (GMM) based classifier is created with unsupervised training. Channel and speaker normalization are deployed for improving robustness. Different delta techniques are investigated for capturing temporal information. And a background model is introduced to reduce the impact of insufficient training data. The final system achieves 71.1% and 77.5% accuracy on two different tasks, each of which has three discrete cognitive load levels. This performance shows a great potential in real-world applications.

引用

页码：2041 / 2044

页数：4

共 50 条

[31] Quester: A Speech-Based Question Answering Support System for Oral Presentations
Asadi, Reza
Trinh, Ha
Fell, Harriet J.
Bickmore, Timothy W.
IUI 2018: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES, 2018, : 583 - 594
[32] Robust Speech-Based Happiness Recognition
Lin, Chang-Hong
Siahaan, Ernestasia
Chin, Yu-Hau
Chen, Bo-Wei
Wang, Jia-Ching
Wang, Jhing-Fa
1ST INTERNATIONAL CONFERENCE ON ORANGE TECHNOLOGIES (ICOT 2013), 2013, : 227 - 230
[33] A Speech-Based AI for Political Participation
Braeuer, Paula
Mazarakis, Athanasios
MUC 2022: PROCEEDINGS OF MENSCH UND COMPUTER 2022, 2022, : 462 - 466
[34] Unsupervised Filterbank Learning for Speech-based Access System for Agricultural Commodity
Sailor, Hardik B.
Patil, Hemant A.
Rajpal, Avni
2017 NINTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2017, : 210 - 215
[35] "Assistance-On-Demand": a Speech-Based Assistance System for Urban Intersections
Schoemig, Nadja
Maag, Christian
Heckmann, Martin
Neukum, Alexandra
Wersing, Heiko
AUTOMOTIVEUI 2016: 8TH INTERNATIONAL CONFERENCE ON AUTOMOTIVE USER INTERFACES AND INTERACTIVE VEHICULAR APPLICATIONS, 2016, : 51 - 56
[36] A review of speech-based bimodal recognition
Chibelushi, CC
Deravi, F
Mason, JSD
IEEE TRANSACTIONS ON MULTIMEDIA, 2002, 4 (01) : 23 - 37
[37] Usability engineering of speech-based services
Sidhu, Charanjit K.
Coyle, Gerry
British Telecommunications Engineering, 1996, 14 (pt 4): : 337 - 340
[38] Deploying a speech-based information system as a research platform for speech recognition research in real environments
Nishimura, R
Nishihara, Y
Tsurumi, R
Lee, A
Saruwatari, H
Shikano, K
ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2005, 88 (12): : 43 - 54
[39] Speech-based interaction with in-vehicle computers: The effect of speech-based e-mail on drivers' attention to the roadway
Lee, JD
Caven, B
Haake, S
Brown, TL
HUMAN FACTORS, 2001, 43 (04) : 631 - 640
[40] Employing Bottleneck and Convolutional Features for Speech-Based Physical Load Detection on Limited Data Amounts
Egorow, Olga
Mrech, Tarik
Weisskirchen, Norman
Wendemuth, Andreas
INTERSPEECH 2019, 2019, : 1666 - 1670

← 1 2 3 4 5 →