MyUEVision: an application generating image caption for assisting visually impaired people

被引:0
|
作者
Nguyen, Hung [1 ]
Huynh, Thai [2 ]
Tran, Nha [1 ]
Nguyen, Toan [1 ]
机构
[1] Ho Chi Minh City Univ Educ, Dept Comp Sci, Ho Chi Minh City, Vietnam
[2] Ho Chi Minh City Univ Educ, Ho Chi Minh City, Vietnam
关键词
Mobile application; API; Visual impaired; Image captioning; Cloud computing;
D O I
10.1108/JET-03-2024-0024
中图分类号
R49 [康复医学];
学科分类号
100215 ;
摘要
PurposeVisually impaired people usually struggle with doing daily tasks due to a lack of visual cues. For image captioning assistive applications, most applications require an Internet connection for the image captioning generation function to work properly. In this study, we developed MyUEVision, an application that assists visually impaired people by generating image captions that can work with and without the Internet. This work also involves reviewing some image captioning models for this application.Design/methodology/approachThe author has selected and experimented with three image captioning models for online models and two image captioning models for offline models. The user experience (UX) design was designed based on the problems faced by visually impaired users when using mobile applications. The application is developed for the Android platform, and the offline model is integrated into the application for the image captioning generation function to work offline.FindingsAfter conducting experiments for selecting online and offline models, ExpansionNet V2 is chosen for the online model and VGG16 + long short-term memory (LSTM) is chosen for the offline model. The application is then developed and assessed, and the results show that the application can generate image captions with or without the Internet, providing the best result when having an Internet connection, and the image is captured in good lighting with a few objects.Originality/valueMyUEVision stands out for its both online and offline functionality. This approach ensures the image captioning generator works with or without the Internet, setting it apart as a unique solution to address the needs of visually impaired individuals.
引用
收藏
页码:248 / 264
页数:17
相关论文
共 50 条
  • [21] Assistive mobile application for visually impaired people
    Nayak S.
    Chandrakala C.B.
    Chandrakala, C.B. (chandrakala.cb@manipal.edu), 1600, International Association of Online Engineering (14): : 52 - 69
  • [22] Navigation Assistive Application for the Visually Impaired People
    Oksana, Luchsheva
    Ihor, Turkin
    Pavlo, Luchshev
    2020 IEEE 11TH INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS, SERVICES AND TECHNOLOGIES (DESSERT): IOT, BIG DATA AND AI FOR A SAFE & SECURE WORLD AND INDUSTRY 4.0, 2020, : 320 - 325
  • [23] Describing Image Focused in Cognitive and Visual Details for Visually Impaired People: An Approach to Generating Inclusive Paragraphs
    Fernandes, Daniel L.
    Ribeiro, Marcos H. F.
    Cerqueira, Fabio R.
    Silva, Michel M.
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2022, : 526 - 534
  • [24] Utilizing Artificial Intelligence Techniques for Assisting Visually Impaired People: A Personal AI-based Assistive Application
    Alhazmi S.
    Kutbi M.
    Alhelaly S.
    Dawood U.
    Felemban R.
    Alaslani E.
    International Journal of Advanced Computer Science and Applications, 2022, 13 (08) : 813 - 820
  • [25] Image Sensing System for Navigating Visually Impaired People
    Gonnot, Thomas
    Saniie, Jafar
    2013 IEEE SENSORS, 2013, : 1847 - 1850
  • [26] A New Image Captioning Approach for Visually Impaired People
    Makav, Burak
    Kilic, Volkan
    2019 11TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ELECO 2019), 2019, : 945 - 949
  • [27] Image to Audio Frequencies Modulation for Visually Impaired People
    Gonnot, Thomas
    Mikuta, Matthew
    Saniie, Jafar
    2017 IEEE INTERNATIONAL CONFERENCE ON ELECTRO INFORMATION TECHNOLOGY (EIT), 2017, : 291 - 294
  • [28] Image Edges Locator Dedicated to Visually Impaired People - an Experimental Application for Mobile Devices
    Pitera, Anna
    Bober, Dariusz
    Kesik, Jacek
    2015 IEEE 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS: TECHNOLOGY AND APPLICATIONS (IDAACS), VOLS 1-2, 2015, : 660 - 665
  • [29] Algorithms and techniques for image to sound conversion for helping the visually impaired people - Application proposal
    Cazan, Drd. Alexandru
    Varbanescu, Radu
    Popescu, Dan
    2007 14TH INTERNATIONAL WORKSHOP ON SYSTEMS, SIGNALS, & IMAGE PROCESSING & EURASIP CONFERENCE FOCUSED ON SPEECH & IMAGE PROCESSING, MULTIMEDIA COMMUNICATIONS & SERVICES, 2007, : 223 - 226
  • [30] Method for Generating Captions for Clothing Images to Support Visually Impaired People
    Tateno, Kiri
    Takagi, Noboru
    Sawai, Kei
    Masuta, Hiroyuki
    Motoyoshi, Tatsuo
    2020 JOINT 11TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS AND 21ST INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (SCIS-ISIS), 2020, : 315 - 319