The Chatbot Usability Scale: the Design and Pilot of a Usability Scale for Interaction with AI-Based Conversational Agents

被引:0
|
作者
Borsci, Simone [1 ,2 ]
Malizia, Alessio [3 ,4 ]
Schmettow, Martin [1 ]
van der Velde, Frank [1 ]
Tariverdiyeva, Gunay [5 ]
Balaji, Divyaa [6 ]
Chamberlain, Alan [7 ]
机构
[1] Department of Learning, Data analysis, and Technology, Cognition, Data and Education (CODE) group, Faculty of Behavioural Management and Social sciences, University of Twente, Enschede, Netherlands
[2] NIHR London In-Vitro Diagnostics Cooperative, Imperial College of London, London, United Kingdom
[3] Computer Science Department, University of Pisa, Pisa, Italy
[4] Molde University College, Molde, Norway
[5] Backbase, Amsterdam, Netherlands
[6] Faculty of Social and Behavioural Sciences, University of Amsterdam, Amsterdam, Netherlands
[7] School of Computer Science, University of Nottingham, Nottingham, United Kingdom
基金
英国工程与自然科学研究理事会; 英国科研创新办公室;
关键词
Buses - Artificial intelligence - Autonomous agents - Surveys - Usability engineering;
D O I
暂无
中图分类号
学科分类号
摘要
Standardised tools to assess a user’s satisfaction with the experience of using chatbots and conversational agents are currently unavailable. This work describes four studies, including a systematic literature review, with an overall sample of 141 participants in the survey (experts and novices), focus group sessions and testing of chatbots to (i) define attributes to assess the quality of interaction with chatbots and (ii) the designing and piloting a new scale to measure satisfaction after the experience with chatbots. Two instruments were developed: (i) A diagnostic tool in the form of a checklist (BOT-Check). This tool is a development of previous works which can be used reliably to check the quality of a chatbots experience in line with commonplace principles. (ii) A 15-item questionnaire (BOT Usability Scale, BUS-15) with estimated reliability between.76 and.87 distributed in five factors. BUS-15 strongly correlates with UMUX-LITE by enabling designers to consider a broader range of aspects usually not considered in satisfaction tools for non-conversational agents, e.g. conversational efficiency and accessibility, quality of the chatbot’s functionality and so on. Despite the convincing psychometric properties, BUS-15 requires further testing and validation. Designers can use it as a tool to assess products, thus building independent databases for future evaluation of its reliability, validity and sensitivity. © 2021, The Author(s).
引用
收藏
页码:95 / 119
相关论文
共 50 条
  • [41] Shaping and evaluating a system for affective computing in online higher education using a participatory design and the system usability scale
    Shingjergji, Krist
    Urlings, Corrie
    Iren, Deniz
    Klemke, Roland
    FOURTEENTH INTERNATIONAL CONFERENCE ON LEARNING ANALYTICS & KNOWLEDGE, LAK 2024, 2024, : 382 - 391
  • [42] Designing Virtual Reality-Based Conversational Agents to Train Clinicians in Verbal De-escalation Skills: Exploratory Usability Study
    Moore, Nathan
    Ahmadpour, Naseem
    Brown, Martin
    Poronnik, Philip
    Davids, Jennifer
    JMIR SERIOUS GAMES, 2022, 10 (03):
  • [43] Students' attitudes towards an AI-based interview for university admissions: Scale development and intervention effects
    Shin, Namin
    Choi, Eunyoung
    Chang, Sejin
    EDUCATION AND INFORMATION TECHNOLOGIES, 2024, 29 (15) : 20055 - 20076
  • [44] Using Data Augmentation to Support AI-Based Requirements Evaluation in Large-Scale Projects
    Korfmann, Robin
    Beyersdorffer, Patrick
    Muench, Juergen
    Kuhrmann, Marco
    SYSTEMS, SOFTWARE AND SERVICES PROCESS IMPROVEMENT, EUROSPI 2024, PT I, 2024, 2179 : 97 - 111
  • [45] Augmented dialectal speech recognition for AI-based neuropsychological scale assessment in Alzheimer's disease
    Zhang, Meiwei
    Cui, Qiushi
    Li, Wenyuan
    Yu, Weihua
    Chen, Lihua
    Li, Wenjie
    Zhu, Chenzhe
    Lu, Yang
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 99
  • [46] AI-Based Nano-Scale Material Property Prediction for Li-Ion Batteries
    Lal, Mohit Anil
    Singh, Akashdeep
    Mzik, Ryan
    Lanjan, Amirmasoud
    Srinivasan, Seshasai
    BATTERIES-BASEL, 2024, 10 (02):
  • [47] Introducing CASUX: A Standardized Scale for Measuring the User Experience of Artificial Intelligence Based Conversational Agents
    Faruk, Lawal Ibrahim Dutsinma
    Pal, Debajyoti
    Funilkul, Suree
    Perumal, Thinagaran
    Mongkolnam, Pornchai
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2024,
  • [48] Multimodal Usability of Human-Computer Interaction Based on Task-Human-Computer-Centered Design
    Shi, Yanbin
    Li, Xiaoqi
    Ouyang, Dantong
    Jiang, Hongtao
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2017, PT I, 2017, 10361 : 459 - 468
  • [49] MECP2 duplication syndrome: phenotypic characterization, AI-based diagnosis, and severity scale development
    Martinez-Monseny, Antonio Federico
    Vega Hanna, Lourdes
    Casas-Alba, Didac
    Palau Martinez, Francesc
    Bolasell, Merce
    Rubio, Patricia
    O'callaghan, Mar
    Pascual-Alonso, Ainhoa
    Balsells, Sol
    Armstrong, Judith
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 1068 - 1068
  • [50] AI-Based Evaluation of Homogeneous Flow Volume Fractions Independent of Scale Using Capacitance and Photon Sensors
    Mayet, Abdulilah M.
    Mohammed, Salman A.
    Qamar, Shamimul
    Loukil, Hassen
    Shukla, Neeraj K.
    ARO-THE SCIENTIFIC JOURNAL OF KOYA UNIVERSITY, 2024, 12 (02): : 167 - 178