Human-computer multimodal interface to internet navigation

被引:6
|
作者
Mosquera-DeLaCruz, Jose H. [1 ]
Loaiza-Correa, Humberto [1 ]
Nope-Rodriguez, Sandra E. [1 ]
Restrepo-Giro, Andres D. [1 ]
机构
[1] Univ Valle, Sch Elect & Elect Engn, Cali, Colombia
关键词
Human-computer interaction; speech recognition; computer vision; internet vavigation; SPEECH; PEOPLE;
D O I
10.1080/17483107.2020.1799440
中图分类号
R49 [康复医学];
学科分类号
100215 ;
摘要
Purpose: Propose and develop new multimodal interfaces that allow people with motor impairments to control mass use applications in a natural way through gestures and voice. Methods: A multimodal interaction interface was developed for using Google Chrome, Gmail and Facebook applications through gestural and verbal commands. The interface activates mouse and keyboard commands from the processing of voice signals and videos of the user's head movement. The interface does not disable traditional keyboard and mouse functions; moreover, it only requires a webcam and a microphone, which are usually built into portable computers. Results: The tests were performed on three groups of people: young adults, older adults and people with motor impairments. The verbal interaction was tested on a total of 189 voice commands with an average performance of 75.7% and a total of 105 dictations. The dictations had an average of 13 words; the system had a performance of 81.1%. Moreover, the gestural interaction activated 126 commands without errors using a drop-down menu; a click was activated 84 times with a success rate of 70.2%. Conclusion: The motor impairments group especially valued the option of using Google Chrome, Gmail and Facebook without physically manipulating a mouse and keyboard. This group showed a greater preference for verbal control than for gestural control. An adaptation period is required for the adults group to acquire greater skill in using the interface. The young adults group preferred the types of interactions they are accustomed to due to their familiarity with Information and Communication Technologies (ICT); they considered the interface fun. Implication for rehabilitation Computers and their applications were conceived with unnatural interaction mechanisms, such as the keyboard and mouse, which prevent their use by people with psychophysiological limitations or digital literacy. For this reason, the need arises to design new natural interfaces commanded by gestures and voice. It is necessary to develop low-cost interfaces that can control mass-use applications such as Google Chrome, Facebook and Gmail, which do not require additional hardware using webcams and microphones, which are usually integrated into laptops. The development of these multimodal interfaces improves the quality of life of people with motor impairments, allowing them to have access to Information and Communication Technologies (ICT).
引用
收藏
页码:807 / 820
页数:14
相关论文
共 50 条
  • [21] Gaze tracking for multimodal human-computer interaction
    Stiefelhagen, R
    Yang, J
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 2617 - 2620
  • [22] New Applications of Multimodal Human-Computer Interfaces
    Czyzewski, Andrzej
    2012 JOINT CONFERENCE NEW TRENDS IN AUDIO & VIDEO AND SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, & APPLICATIONS (NTAV-SPA 2012), 2012, : 19 - 24
  • [23] Displays as Advanced Human-Computer Interface
    Nakatani, Yoshio
    IDW'11: PROCEEDINGS OF THE 18TH INTERNATIONAL DISPLAY WORKSHOPS, VOLS 1-3, 2011, : 437 - 440
  • [24] Human-Computer Interface Controlled by the Lip
    Jose, Marcelo Archajo
    Lopes, Roseli de Deus
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2015, 19 (01) : 302 - 308
  • [25] INNOVATIVE INTERFACE FOR HUMAN-COMPUTER INTERACTION
    Rolshofen, W.
    Dietz, P.
    Schaefer, G.
    9TH INTERNATIONAL DESIGN CONFERENCE - DESIGN 2006, VOLS 1 AND 2, 2006, (36): : 611 - +
  • [26] SPOKEN MULTIMODAL HUMAN-COMPUTER DIALOGUE Preface
    Minker, Wolfgang
    Buehler, Dirk
    Dybkjaer, Laila
    SPOKEN MULTIMODAL HUMAN-COMPUTER DIALOGUE IN MOBILE ENVIRONMENTS, 2005, 28 : XI - XII
  • [27] THE HUMAN-COMPUTER INTERFACE - WHAT NEXT
    CLARKE, DJ
    ELECTRONICS AND POWER, 1986, 32 (06): : 440 - 440
  • [28] ENGINEERING THE HUMAN-COMPUTER INTERFACE - DOWNTON,A
    WILSON, J
    APPLIED ERGONOMICS, 1992, 23 (05) : 361 - 361
  • [29] Human-computer interface for collaborative argumentation
    Wen, LR
    Duh, CM
    INTERNATIONAL SYMPOSIUM ON MULTIMEDIA SOFTWARE ENGINEERING, PROCEEDINGS, 2000, : 352 - 355
  • [30] Speech recognition in the human-computer interface
    Rebman, CM
    Aiken, MW
    Cegielski, CG
    INFORMATION & MANAGEMENT, 2003, 40 (06) : 509 - 519