Human-computer multimodal interface to internet navigation

被引:6
|
作者
Mosquera-DeLaCruz, Jose H. [1 ]
Loaiza-Correa, Humberto [1 ]
Nope-Rodriguez, Sandra E. [1 ]
Restrepo-Giro, Andres D. [1 ]
机构
[1] Univ Valle, Sch Elect & Elect Engn, Cali, Colombia
关键词
Human-computer interaction; speech recognition; computer vision; internet vavigation; SPEECH; PEOPLE;
D O I
10.1080/17483107.2020.1799440
中图分类号
R49 [康复医学];
学科分类号
100215 ;
摘要
Purpose: Propose and develop new multimodal interfaces that allow people with motor impairments to control mass use applications in a natural way through gestures and voice. Methods: A multimodal interaction interface was developed for using Google Chrome, Gmail and Facebook applications through gestural and verbal commands. The interface activates mouse and keyboard commands from the processing of voice signals and videos of the user's head movement. The interface does not disable traditional keyboard and mouse functions; moreover, it only requires a webcam and a microphone, which are usually built into portable computers. Results: The tests were performed on three groups of people: young adults, older adults and people with motor impairments. The verbal interaction was tested on a total of 189 voice commands with an average performance of 75.7% and a total of 105 dictations. The dictations had an average of 13 words; the system had a performance of 81.1%. Moreover, the gestural interaction activated 126 commands without errors using a drop-down menu; a click was activated 84 times with a success rate of 70.2%. Conclusion: The motor impairments group especially valued the option of using Google Chrome, Gmail and Facebook without physically manipulating a mouse and keyboard. This group showed a greater preference for verbal control than for gestural control. An adaptation period is required for the adults group to acquire greater skill in using the interface. The young adults group preferred the types of interactions they are accustomed to due to their familiarity with Information and Communication Technologies (ICT); they considered the interface fun. Implication for rehabilitation Computers and their applications were conceived with unnatural interaction mechanisms, such as the keyboard and mouse, which prevent their use by people with psychophysiological limitations or digital literacy. For this reason, the need arises to design new natural interfaces commanded by gestures and voice. It is necessary to develop low-cost interfaces that can control mass-use applications such as Google Chrome, Facebook and Gmail, which do not require additional hardware using webcams and microphones, which are usually integrated into laptops. The development of these multimodal interfaces improves the quality of life of people with motor impairments, allowing them to have access to Information and Communication Technologies (ICT).
引用
收藏
页码:807 / 820
页数:14
相关论文
共 50 条
  • [1] Toward multimodal human-computer interface
    Sharma, Rajeev
    Pavlovic, Vladimir I.
    Huang, Thomas S.
    Proceedings of the IEEE, 1998, 86 (5 pt 1): : 853 - 869
  • [2] Toward multimodal human-computer interface
    Sharma, R
    Pavlovic, VI
    Huang, TS
    PROCEEDINGS OF THE IEEE, 1998, 86 (05) : 853 - 869
  • [3] Navigation Through Eye-Tracking for Human-Computer Interface
    Pavani, M. Lakshmi
    Prakash, A. V. Bhanu
    Koushik, M. S. Shwetha
    Amudha, J.
    Jyotsna, C.
    INFORMATION AND COMMUNICATION TECHNOLOGY FOR INTELLIGENT SYSTEMS, ICTIS 2018, VOL 2, 2019, 107 : 575 - 586
  • [4] Multimodal human-computer interaction
    Turk, M
    REAL-TIME VISION FOR HUMAN-COMPUTER INTERACTION, 2005, : 269 - 283
  • [5] Multimodal human-computer interfaces
    Dutoit, Thierry
    Nigay, Laurence
    Schnaider, Michael
    SIGNAL PROCESSING, 2006, 86 (12) : 3515 - 3517
  • [6] Human-Computer Interaction: Process and Principles of Human-Computer Interface Design
    Chao, Gong
    2009 INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING, PROCEEDINGS, 2009, : 230 - 233
  • [7] Considerations on Multimodal Human-Computer Interaction
    Liu, Chang
    Xie, Wenjun
    Zhang, Peng
    Zhan, Jia
    Xiao, Zonghao
    PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2018, : 331 - 335
  • [8] MULTIMODAL COLLABORATION AND HUMAN-COMPUTER INTERACTION
    Zhang, Zhengyou
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 1596 - 1599
  • [9] Multimodal Interfaces of Human-Computer Interaction
    Karpov, A. A.
    Yusupov, R. M.
    HERALD OF THE RUSSIAN ACADEMY OF SCIENCES, 2018, 88 (01) : 67 - 74
  • [10] Multimodal human-computer interaction: A survey
    Jaimes, Alejandro
    Sebe, Nicu
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2007, 108 (1-2) : 116 - 134