Speech recognition for command entry in multimodal interaction

被引:3
|
作者
Tyfa, DA [1 ]
Howes, M [1 ]
机构
[1] Univ Leeds, Sch Psychol, Leeds LS2 9JT, W Yorkshire, England
基金
英国工程与自然科学研究理事会;
关键词
speech recognition; multiple resources; multimodal interaction; command entry; hands-busy; eyes-busy; verbal interference;
D O I
10.1006/ijhc.1999.0355
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Two experiments investigated the cognitive efficiency of using speech recognition in combination with the mouse and keyboard for a range of word processing tasks. The first experiment examined the potential of this multimodal combination to increase performance by engaging concurrent multiple resources. Speech and mouse responses were compared when using menu and direct (toolbar icon) commands, making for a fairer comparison than in previous research which has been biased against the mouse. Only a limited basis for concurrent resource use was found, with speech leading to poorer task performance with both command types. Task completion times were faster with direct commands for both speech and mouse responses, and direct commands were preferred. In the second experiment, participants were free to choose command type, and nearly always chose to use direct commands with both response modes. Speech performance was again worse than mouse, except for tasks which involved a large amount of hand and eye movement, or where direct speech was used but mouse commands were made via menus. In both experiments recognition errors were low, and although they had some detrimental effect on speech use, problems in combining speech and manual modes were highlighted. Potential verbal interference effects when using speech are discussed. (C) 2000 Academic Press.
引用
收藏
页码:637 / 667
页数:31
相关论文
共 50 条
  • [21] Speech Command Classification System for Sinhala Language based on Automatic Speech Recognition
    Dinushika, Thilini
    Kavmini, Lakshika
    Abeyawardhana, Pamoda
    Thayasivam, Uthayasanker
    Jayasena, Sanath
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2019, : 205 - 210
  • [22] AVAS: Speech Database for Multimodal Recognition Applications
    Antar, Samar
    Sagheer, Alaa
    Aly, Saleh
    Tolba, Mohamed F.
    2013 13TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS), 2013, : 123 - 128
  • [23] Multimodal speech recognition for unmanned aerial vehicles
    Oneață, Dan
    Cucu, Horia
    Computers and Electrical Engineering, 2021, 90
  • [24] Noise-Robust Multimodal Audio-Visual Speech Recognition System for Speech-Based Interaction Applications
    Jeon, Sanghun
    Kim, Mun Sang
    SENSORS, 2022, 22 (20)
  • [25] Multimodal English corpus for automatic speech recognition
    Kunka, Bartosz
    Kupryjanow, Adam
    Dalka, Piotr
    Bratoszewski, Piotr
    Szczodrak, Maciej
    Spaleniak, Pawel
    Szykulski, Marcin
    Czyzewski, Andrzej
    2013 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2013, : 106 - 111
  • [26] Speech Command Recognition: Text-to-Speech and Speech Corpus Scraping Are All You Need
    Kuzdeuov, Askat
    Nurgaliyev, Shakhizat
    Turmakhan, Diana
    Laiyk, Nurkhan
    Varol, Huseyin Atakan
    2023 3RD INTERNATIONAL CONFERENCE ON ROBOTICS, AUTOMATION AND ARTIFICIAL INTELLIGENCE, RAAI 2023, 2023, : 286 - 291
  • [27] Towards the explainability of Multimodal Speech Emotion Recognition
    Kumar, Puneet
    Kaushik, Vishesh
    Raman, Balasubramanian
    INTERSPEECH 2021, 2021, : 1748 - 1752
  • [28] Temporal Multimodal Learning in Audiovisual Speech Recognition
    Hu, Di
    Li, Xuelong
    Lu, Xiaoqiang
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3574 - 3582
  • [29] CONTINUOUS VISUAL SPEECH RECOGNITION FOR MULTIMODAL FUSION
    Benhaim, Eric
    Sahbi, Hichem
    Vitte, Guillaume
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [30] Multimodal speech recognition for unmanned aerial vehicles
    Oneata, Dan
    Cucu, Horia
    COMPUTERS & ELECTRICAL ENGINEERING, 2021, 90