WeldVUI: Establishing Speech-Based Interfaces in Industrial Applications

被引：0

作者：

Augstein, Mirjam ^{[1
]}

Neumayr, Thomas ^{[1
]}

Pimminger, Sebastian ^{[1
]}

机构：

[1] Univ Appl Sci Upper Austria, Hagenberg, Austria

来源：

HUMAN-COMPUTER INTERACTION, INTERACT 2019, PT III | 2019年 / 11748卷

关键词：

Voice user interface design; User-centered design; Interaction design; Speech-based interfaces; Industrial applications; VOICE;

D O I：

10.1007/978-3-030-29387-1_40

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Voice User Interfaces (VUIs) and speech-based applications have recently gained increasing popularity. During the past years, they have been included in a wide range of mass-market devices (smart phones or technology installed in common car cockpits) and are thus available for many everyday interaction scenarios (e.g., making phone calls or switching the lights on and off). This popularity also led to a number of guidelines for VUI design, software libraries and devices for speech recognition becoming available for interface designers and developers. Although generally helpful, these resources are often broad and do not fully satisfy the specific requirements of certain industrial applications. First, grammar and vocabulary in such settings usually differ drastically from everyday scenarios. Second, common software libraries and devices are often not able to comply with the conditions in industrial environments (e.g. involving high levels of noise). This paper describes the iterative, user-centered design process for VUIs and functional speech-based interaction prototypes for the domain of industrial welding, including a two-stage Wizard of Oz procedure, rapid prototyping, speech recognition improvement and thorough user involvement. Our experiences throughout this process generalize to other industrial applications and so-called "niche applications" where grammar and vocabulary usually have to be established from scratch. They are intended to guide other researchers setting up a similar process for designing and prototyping domain-specific VUIs.

引用

页码：679 / 698

页数：20

共 50 条

[31] Noise-Robust Multimodal Audio-Visual Speech Recognition System for Speech-Based Interaction Applications
Jeon, Sanghun
Kim, Mun Sang
SENSORS, 2022, 22 (20)
[32] Speaker normalisation for speech-based emotion detection
Sethu, Vidhyasaharan
Ambikairajah, Eliathainby
Epps, Julien
PROCEEDINGS OF THE 2007 15TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, 2007, : 611 - +
[33] VOICE: a framework for speech-based mobile systems
Sharp, Adam
Kurkovsky, Stan
21ST INTERNATIONAL CONFERENCE ON ADVANCED NETWORKING AND APPLICATIONS WORKSHOPS/SYMPOSIA, VOL 2, PROCEEDINGS, 2007, : 38 - +
[34] Speech-Based Annotation and Retrieval of Digital Photographs
Hazen, Timothy J.
Sherry, Brennan
Adler, Mark
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2077 - +
[35] An Exploration of Speech-Based Productivity Support in the Car
Martelaro, Nikolas
Teevan, Jaime
Iqbal, Shamsi T.
CHI 2019: PROCEEDINGS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
[36] Speech-based Interaction: Myths, Challenges, and Opportunities
Munteanu, Cosmin
Penn, Gerald
PROCEEDINGS OF THE 16TH ACM INTERNATIONAL CONFERENCE ON HUMAN-COMPUTER INTERACTION WITH MOBILE DEVICES AND SERVICES (MOBILEHCI'14), 2014, : 567 - 568
[37] The SRI Speech-Based Collaborative Learning Corpus
Richey, Colleen
D'Angelo, Cynthia
Alozie, Nonye
Bratt, Harry
Shriberg, Elizabeth
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1550 - 1554
[38] Speech-Based Interface For Visually Impaired Users
Huang, Yi-Chin
Tsai, Cheng-Hung
IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 1223 - 1228
[39] Speech-Based Automated Cognitive Status Assessment
Hakkani-Tuer, Dilek
Vergyri, Dimitra
Tur, Gokhan
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 258 - +
[40] Contemporary Reflections on Speech-Based Language Learning
Gustafson, Marianne
VOLTA REVIEW, 2009, 109 (2-3) : 143 - 153

← 1 2 3 4 5 →