Usefulness, localizability, humanness, and language-benefit: additional evaluation criteria for natural language dialogue systems

被引:18
作者
AbuShawar, Bayan [1 ]
Atwell, Eric [2 ]
机构
[1] Arab Open Univ, IT Dept, POB 1339, Amman 11953, Jordan
[2] Univ Leeds, Sch Comp, Leeds LS2 9JT, W Yorkshire, England
关键词
Chatbot; Usefulness; Localizability; Humanness; Naturalness; Language benefit;
D O I
10.1007/s10772-015-9330-4
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Human-computer dialogue systems interact with human users using natural language. We used the ALICE/AIML chatbot architecture as a platform to develop a range of chatbots covering different languages, genres, text-types, and user-groups, to illustrate qualitative aspects of natural language dialogue system evaluation. We present some of the different evaluation techniques used in natural language dialogue systems, including black box and glass box, comparative, quantitative, and qualitative evaluation. Four aspects of NLP dialogue system evaluation are often overlooked: "usefulness'' in terms of a user's qualitative needs, "localizability'' to new genres and languages, "humanness'' or "naturalness'' compared to human-human dialogues, and "language benefit'' compared to alternative interfaces. We illustrated these aspects with respect to our work on machine-learnt chatbot dialogue systems; we believe these aspects are worthwhile in impressing potential new users and customers.
引用
收藏
页码:373 / 383
页数:11
相关论文
共 39 条
[1]  
Abu Shawar B., 2010, P 6 IASTED INT C ADV, P183, DOI [10.2316/P.2010.689-050, DOI 10.2316/P.2010.689-050]
[2]  
Abu Shawar B., 2008, P INFOS2008
[3]   A Chatbot as a Natural Web Interface to Arabic Web QA [J].
Abu Shawar, Bayan .
INTERNATIONAL JOURNAL OF EMERGING TECHNOLOGIES IN LEARNING, 2011, 6 (01) :37-43
[4]   THE PHILIPS AUTOMATIC TRAIN TIMETABLE INFORMATION-SYSTEM [J].
AUST, H ;
OERDER, M ;
SEIDE, F ;
STEINBISS, V .
SPEECH COMMUNICATION, 1995, 17 (3-4) :249-262
[5]  
Bamberger Craig S., 1996, ENERGY CHARTER TREAT, V1, P1
[6]   The role of a natural language conversational interface in online sales: A case study [J].
Chai J. ;
Lin J. ;
Zadrozny W. ;
Ye Y. ;
Stys-Budzikowska M. ;
Horvath V. ;
Kambhatla N. ;
Wolf C. .
International Journal of Speech Technology, 2001, 4 (3-4) :285-295
[7]  
Chai Joyce Yue, 2001, P 13 C INNOVATIVE AP, P19
[8]  
Colby K.M., 1973, COMPUTER MODELS THOU, P251
[9]  
Crockett K., 2009, P 6 IJCAI WORKSH KNO
[10]  
Cunningham H., 1999, Natural Language Engineering, V5, P1, DOI 10.1017/S1351324999002144