WeldVUI: Establishing Speech-Based Interfaces in Industrial Applications

被引:0
|
作者
Augstein, Mirjam [1 ]
Neumayr, Thomas [1 ]
Pimminger, Sebastian [1 ]
机构
[1] Univ Appl Sci Upper Austria, Hagenberg, Austria
关键词
Voice user interface design; User-centered design; Interaction design; Speech-based interfaces; Industrial applications; VOICE;
D O I
10.1007/978-3-030-29387-1_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Voice User Interfaces (VUIs) and speech-based applications have recently gained increasing popularity. During the past years, they have been included in a wide range of mass-market devices (smart phones or technology installed in common car cockpits) and are thus available for many everyday interaction scenarios (e.g., making phone calls or switching the lights on and off). This popularity also led to a number of guidelines for VUI design, software libraries and devices for speech recognition becoming available for interface designers and developers. Although generally helpful, these resources are often broad and do not fully satisfy the specific requirements of certain industrial applications. First, grammar and vocabulary in such settings usually differ drastically from everyday scenarios. Second, common software libraries and devices are often not able to comply with the conditions in industrial environments (e.g. involving high levels of noise). This paper describes the iterative, user-centered design process for VUIs and functional speech-based interaction prototypes for the domain of industrial welding, including a two-stage Wizard of Oz procedure, rapid prototyping, speech recognition improvement and thorough user involvement. Our experiences throughout this process generalize to other industrial applications and so-called "niche applications" where grammar and vocabulary usually have to be established from scratch. They are intended to guide other researchers setting up a similar process for designing and prototyping domain-specific VUIs.
引用
收藏
页码:679 / 698
页数:20
相关论文
共 50 条
  • [31] Noise-Robust Multimodal Audio-Visual Speech Recognition System for Speech-Based Interaction Applications
    Jeon, Sanghun
    Kim, Mun Sang
    SENSORS, 2022, 22 (20)
  • [32] Speaker normalisation for speech-based emotion detection
    Sethu, Vidhyasaharan
    Ambikairajah, Eliathainby
    Epps, Julien
    PROCEEDINGS OF THE 2007 15TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, 2007, : 611 - +
  • [33] VOICE: a framework for speech-based mobile systems
    Sharp, Adam
    Kurkovsky, Stan
    21ST INTERNATIONAL CONFERENCE ON ADVANCED NETWORKING AND APPLICATIONS WORKSHOPS/SYMPOSIA, VOL 2, PROCEEDINGS, 2007, : 38 - +
  • [34] Speech-Based Annotation and Retrieval of Digital Photographs
    Hazen, Timothy J.
    Sherry, Brennan
    Adler, Mark
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2077 - +
  • [35] An Exploration of Speech-Based Productivity Support in the Car
    Martelaro, Nikolas
    Teevan, Jaime
    Iqbal, Shamsi T.
    CHI 2019: PROCEEDINGS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
  • [36] Speech-based Interaction: Myths, Challenges, and Opportunities
    Munteanu, Cosmin
    Penn, Gerald
    PROCEEDINGS OF THE 16TH ACM INTERNATIONAL CONFERENCE ON HUMAN-COMPUTER INTERACTION WITH MOBILE DEVICES AND SERVICES (MOBILEHCI'14), 2014, : 567 - 568
  • [37] The SRI Speech-Based Collaborative Learning Corpus
    Richey, Colleen
    D'Angelo, Cynthia
    Alozie, Nonye
    Bratt, Harry
    Shriberg, Elizabeth
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1550 - 1554
  • [38] Speech-Based Interface For Visually Impaired Users
    Huang, Yi-Chin
    Tsai, Cheng-Hung
    IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 1223 - 1228
  • [39] Speech-Based Automated Cognitive Status Assessment
    Hakkani-Tuer, Dilek
    Vergyri, Dimitra
    Tur, Gokhan
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 258 - +
  • [40] Contemporary Reflections on Speech-Based Language Learning
    Gustafson, Marianne
    VOLTA REVIEW, 2009, 109 (2-3) : 143 - 153