Speech recognition for command entry in multimodal interaction

被引:3
|
作者
Tyfa, DA [1 ]
Howes, M [1 ]
机构
[1] Univ Leeds, Sch Psychol, Leeds LS2 9JT, W Yorkshire, England
基金
英国工程与自然科学研究理事会;
关键词
speech recognition; multiple resources; multimodal interaction; command entry; hands-busy; eyes-busy; verbal interference;
D O I
10.1006/ijhc.1999.0355
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Two experiments investigated the cognitive efficiency of using speech recognition in combination with the mouse and keyboard for a range of word processing tasks. The first experiment examined the potential of this multimodal combination to increase performance by engaging concurrent multiple resources. Speech and mouse responses were compared when using menu and direct (toolbar icon) commands, making for a fairer comparison than in previous research which has been biased against the mouse. Only a limited basis for concurrent resource use was found, with speech leading to poorer task performance with both command types. Task completion times were faster with direct commands for both speech and mouse responses, and direct commands were preferred. In the second experiment, participants were free to choose command type, and nearly always chose to use direct commands with both response modes. Speech performance was again worse than mouse, except for tasks which involved a large amount of hand and eye movement, or where direct speech was used but mouse commands were made via menus. In both experiments recognition errors were low, and although they had some detrimental effect on speech use, problems in combining speech and manual modes were highlighted. Potential verbal interference effects when using speech are discussed. (C) 2000 Academic Press.
引用
收藏
页码:637 / 667
页数:31
相关论文
共 50 条
  • [1] Multimodal systems for speech recognition
    Mamyrbayev, Orken Zh
    Alimhan, Keylan
    Amirgaliyev, Beibut
    Zhumazhanov, Bagashar
    Mussayeva, Dinara
    Gusmanova, Farida
    INTERNATIONAL JOURNAL OF MOBILE COMMUNICATIONS, 2020, 18 (03) : 314 - 326
  • [2] Multimodal recognition of speech and electrocorticogram
    Ahuja, Mitali
    Komeiji, Shuji
    Mitsuhashi, Takumi
    Iimura, Yasushi
    Suzuki, Hiroharu
    Sugano, Hidenori
    Shinoda, Koichi
    Tanaka, Toshihisa
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 546 - 550
  • [3] Automatic Speech Recognition Technique For Voice Command
    Gupta, Anshul
    Patel, Nileshkumar
    Khan, Shabana
    2014 INTERNATIONAL CONFERENCE ON SCIENCE ENGINEERING AND MANAGEMENT RESEARCH (ICSEMR), 2014,
  • [4] Listening to the World Improves Speech Command Recognition
    McMahan, Brian
    Rao, Delip
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 378 - 385
  • [5] Speech Command Recognition Using Deep Learning
    Ayache, Mohammad
    Kanaan, Hussien
    Kassir, Kawthar
    Kassir, Yasser
    2021 SIXTH INTERNATIONAL CONFERENCE ON ADVANCES IN BIOMEDICAL ENGINEERING (ICABME), 2021, : 24 - 29
  • [6] Application of Bernstein and pattern recognition methods for speech command recognition
    Department of Computer Education, Gazi University, 06500 Ankara, Turkey
    J. Appl. Sci., 2007, 20 (3063-3068):
  • [7] Speech and Gesture Command Recognition to Improve Human-Robot Interaction in Manual Assembly Lines
    Vento, Mario
    Greco, Antonio
    Carletti, Vincenzo
    ERCIM NEWS, 2023, (132): : 12 - 13
  • [8] Multimodal Speech Recognition with Ultrasonic Sensors
    Zhu, Bo
    Hazen, Timothy J.
    Glass, James R.
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2328 - 2331
  • [9] Multiresolution and Multimodal Speech Recognition with Transformers
    Paraskevopoulos, Georgios
    Parthasarathy, Srinivas
    Khare, Aparna
    Sundaram, Shiva
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 2381 - 2387
  • [10] Incremental speech recognition for multimodal interfaces
    Fink, GA
    Schillo, C
    Kummert, F
    Sagerer, G
    IECON '98 - PROCEEDINGS OF THE 24TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, VOLS 1-4, 1998, : 2012 - 2017