Cognitive state classification in a spoken tutorial dialogue system

被引:7
|
作者
Zhang, Tong [1 ]
Hasegawa-Johnson, Mark [1 ]
Levinson, Stephen E. [1 ]
机构
[1] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL 61801 USA
基金
美国国家科学基金会;
关键词
intelligent tutoring system; user affect recognition; spoken language processing;
D O I
10.1016/j.specom.2005.09.006
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper addresses the manual and automatic labelling, from spontaneous speech, of a particular type of user affect that we call the cognitive state in a tutorial dialogue system with students of primary and early middle school ages. Our definition of the cognitive state is based on analysis of children's spontaneous speech, which is acquired during Wizard-of-Oz simulations of an intelligent math and physics tutor. The cognitive states of children are categorized into three classes: confidence, puzzlement, and hesitation. The manual labelling of cognitive states had an inter-transcriber agreement of kappa score 0.93. The automatic cognitive state labels are generated by classifying prosodic features, text features, and spectral features. Text features are generated from an automatic speech recognition (ASR) system; features include indicator functions of keyword classes and part-of-speech sequences. Spectral features are created based on acoustic likelihood scores of a cognitive state-dependent ASR system, in which phoneme models are adapted to utterances labelled for a particular cognitive state. The effectiveness of the proposed method has been tested on both manually and automatically transcribed speech, and the test yielded very high correctness: 96.6% for manually transcribed speech and 95.7% for automatically recognized speech. Our study shows that the proposed spectral features greatly outperformed the other types of features in the cognitive state classification experiments. Our study also shows that the spectral and prosodic features derived directly from speech signals were very robust to speech recognition errors, much more than the lexical and part-of-speech based features. (C) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:616 / 632
页数:17
相关论文
共 50 条
  • [1] A tutorial dialogue system with unrestricted spoken input
    Bell, Peter
    Dzikovska, Myroslava
    Isard, Amy
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2111 - 2112
  • [2] Dialogue act classification in a spoken dialogue system
    Castro, MJ
    Vilar, D
    Aibar, P
    Sanchis, E
    CURRENT TOPICS IN ARTIFICIAL INTELLIGENCE, 2004, 3040 : 260 - 270
  • [3] Designing a spoken language interface for a tutorial dialogue system
    Bell, Peter
    Dzikovska, Myroslava
    Isard, Amy
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1282 - 1285
  • [4] THE LUNA SPOKEN DIALOGUE SYSTEM: BEYOND UTTERANCE CLASSIFICATION
    Dinarelli, M.
    Stepanov, E. A.
    Varges, S.
    Riccardi, G.
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5366 - 5369
  • [5] Using Information State to Improve Dialogue Move Identification in a Spoken Dialogue System
    Ai, Hua
    Roque, Antonio
    Leuski, Anton
    Traum, David
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2596 - +
  • [6] Affective-Cognitive Dialogue Act Detection in an Error-Aware Spoken Dialogue System
    Liang, Wei-Bin
    Wu, Chung-Hsien
    Sheng, Meng-Hsiu
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [7] Balancing cognitive and motivational scaffolding in tutorial dialogue
    Boyer, Kristy Elizabeth
    Phillips, Robert
    Wallis, Michael
    Vouk, Mladen
    Lester, James
    INTELLIGENT TUTORING SYSTEM, PROCEEDINGS, 2008, 5091 : 239 - 249
  • [8] Target-based state and tracking algorithm for spoken dialogue system
    Li, Miao
    He, Zhiyang
    Wu, Ji
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2711 - 2715
  • [9] Utterance Intent Classification of a Spoken Dialogue System with Efficiently Untied Recursive Autoencoders
    Kato, Tsuneo
    Nagai, Atsushi
    Noda, Naoki
    Sumitomo, Ryosuke
    Wu, Jianming
    Yamamoto, Seiichi
    18TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2017), 2017, : 60 - 64
  • [10] Utterance intent classification of a spoken dialogue system with efficiently untied recursive autoencoders
    Kato, Tsuneo
    Nagai, Atsushi
    Noda, Naoki
    Sumitomo, Ryosuke
    Wu, Jianming
    Yamamoto, Seiichi
    SIGDIAL 2017 - 18th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference, 2017, : 60 - 64