Cognitive state classification in a spoken tutorial dialogue system

被引:7
|
作者
Zhang, Tong [1 ]
Hasegawa-Johnson, Mark [1 ]
Levinson, Stephen E. [1 ]
机构
[1] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL 61801 USA
基金
美国国家科学基金会;
关键词
intelligent tutoring system; user affect recognition; spoken language processing;
D O I
10.1016/j.specom.2005.09.006
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper addresses the manual and automatic labelling, from spontaneous speech, of a particular type of user affect that we call the cognitive state in a tutorial dialogue system with students of primary and early middle school ages. Our definition of the cognitive state is based on analysis of children's spontaneous speech, which is acquired during Wizard-of-Oz simulations of an intelligent math and physics tutor. The cognitive states of children are categorized into three classes: confidence, puzzlement, and hesitation. The manual labelling of cognitive states had an inter-transcriber agreement of kappa score 0.93. The automatic cognitive state labels are generated by classifying prosodic features, text features, and spectral features. Text features are generated from an automatic speech recognition (ASR) system; features include indicator functions of keyword classes and part-of-speech sequences. Spectral features are created based on acoustic likelihood scores of a cognitive state-dependent ASR system, in which phoneme models are adapted to utterances labelled for a particular cognitive state. The effectiveness of the proposed method has been tested on both manually and automatically transcribed speech, and the test yielded very high correctness: 96.6% for manually transcribed speech and 95.7% for automatically recognized speech. Our study shows that the proposed spectral features greatly outperformed the other types of features in the cognitive state classification experiments. Our study also shows that the spectral and prosodic features derived directly from speech signals were very robust to speech recognition errors, much more than the lexical and part-of-speech based features. (C) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:616 / 632
页数:17
相关论文
共 50 条
  • [41] A Spoken Dialogue System for the EMPATHIC Virtual Coach
    Ines Torres, M.
    Mikel Olaso, Javier
    Glackin, Neil
    Justo, Raquel
    Chollet, Gerard
    9TH INTERNATIONAL WORKSHOP ON SPOKEN DIALOGUE SYSTEM TECHNOLOGY, 2019, 579 : 259 - 265
  • [42] Designing and evaluating an adaptive spoken dialogue system
    Litman, DJ
    Pan, SM
    USER MODELING AND USER-ADAPTED INTERACTION, 2002, 12 (2-3) : 111 - 137
  • [43] Empirically evaluating an adaptable spoken dialogue system
    Litman, DJ
    Pan, SM
    UM99: USER MODELING, PROCEEDINGS, 1999, (407): : 55 - 64
  • [44] Adaptivity and response generation in a spoken dialogue system
    Jokinen, K
    Wilcock, G
    CURRENT AND NEW DIRECTIONS IN DISCOURSE AND DIALOGUE, 2003, 22 : 213 - 234
  • [45] Spoken dialogue system DUG-1
    Dohsaka, K
    Nakano, M
    Miyazaki, N
    Hirasawa, J
    Aikawa, K
    NTT REVIEW, 2000, 12 (04): : 62 - 64
  • [46] A Spoken Dialogue System Based on FST and DBN
    Fan, Lichun
    Yu, Dong
    Peng, Xingyuan
    Lu, Shixiang
    Xu, Bo
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, 2012, 333 : 34 - +
  • [47] Spoken Language Understanding for a Nutrition Dialogue System
    Korpusik, Mandy
    Glass, James
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (07) : 1450 - 1461
  • [48] Spoken Interaction within the ComputedWorld: Evaluation of a Multitasking Adaptive Spoken Dialogue System
    Heinroth, Tobias
    Denich, Dan
    2011 35TH IEEE ANNUAL INTERNATIONAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), 2011, : 134 - 143
  • [49] EVALUATION OF A SPOKEN DIALOGUE SYSTEM FOR CONTROLLING A HIFI AUDIO SYSTEM
    Martinez, F. Fernandez
    Blazquez, J.
    Ferreiros, J.
    Barra, R.
    Macias-Guarasa, J.
    Lucas-Cuesta, J. M.
    2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 137 - +
  • [50] Adaptive Intelligent Tutorial Dialogue in the BEETLE II System
    Dzikovska, Myroslava O.
    Isard, Amy
    Bell, Peter
    Moore, Johanna D.
    Steinhauser, Natalie B.
    Campbe, Gwendolyn E.
    Taylor, Leanne S.
    Caine, Simon
    Scott, Charlie
    ARTIFICIAL INTELLIGENCE IN EDUCATION, 2011, 6738 : 621 - 621