An augmented reality for an arabic text reading and visualization assistant for the visually impaired

被引:3
|
作者
Ouali, Imene [1 ]
Ben Halima, Mohamed [1 ]
Wali, Ali [1 ]
机构
[1] Univ Sfax, Natl Engn Sch Sfax ENIS, REGIM Res Grp Intelligent Machines, Sfax, Tunisia
关键词
Text visualization; Text to speech; Text detection; Text recognition; Arabic; Augmented reality; Deep learning; Visually impaired; RECOGNITION;
D O I
10.1007/s11042-023-14880-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text, as one of humanity's most influential innovations, has played an important role in shaping our lives. Reading a text is a difficult task due to several reasons factors, such as luminosity, text orientation, writing style, and very light colors. However, the visually impaired, on the other hand, have difficulty reading a text in all of these situations. In particular, a handwritten text is more difficult to read than a digital text due to the different forms and styles of the handwriting of different writers or, sometimes, of the same writer. Therefore, they would benefit from a device or a system to help them to solve this problem and improve their quality of life. Arabic language recognition and identification is a very difficult task because of diacritics such as consonant score, tashkil, and others. In this context, we propose a recognition and identification system for Arabic Handwritten Texts with Diacritics (AHTD) based on deep learning by using the convolutional neural network. Text images are trained, tested, and validated with our Arabic Handwritten Texts with a Diacritical Dataset (AHT2D). Then, the recognized text is enhanced with augmented reality technology and produced as a 2D image. Finally, the recognized text is converted into an audio output using AR technology. Voice output and visual output are given to the visually impaired user. The experimental results show that the proposed system is robust, with an accuracy rate of 95%.
引用
收藏
页码:43569 / 43597
页数:29
相关论文
共 50 条
  • [31] TEXT STRING DETECTION FOR THE FIRST GRADE VISUALLY IMPAIRED PUPILS "READING MANDARIN TEXTBOOKS"
    Cheng, Ching-Ching
    Tseng, Teng-Hui
    Tsai, Chun-Ming
    PROCEEDINGS OF 2014 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 1, 2014, : 377 - 382
  • [32] ENHANCEMENT OF TEXT FOR THE VISUALLY-IMPAIRED
    FINE, EM
    PELI, E
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1995, 12 (07): : 1439 - 1447
  • [33] Narrative Visualization with Augmented Reality
    Marques, Ana Beatriz
    Branco, Vasco
    Costa, Rui
    MULTIMODAL TECHNOLOGIES AND INTERACTION, 2022, 6 (12)
  • [34] Collaborative visualization in augmented reality
    Fuhrmann, A
    Loffelmann, H
    Schmalstieg, D
    Gervautz, M
    IEEE COMPUTER GRAPHICS AND APPLICATIONS, 1998, 18 (04) : 54 - 59
  • [35] PeriText: Utilizing Peripheral Vision for Reading Text on Augmented Reality Smart Glasses
    Ku, Pin-Sung
    Peng, Yi-Hao
    Lin, Yu-Chih
    Chen, Mike Y.
    2019 26TH IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES (VR), 2019, : 630 - 635
  • [36] Comprehensible Visualization for Augmented Reality
    Kalkofen, Denis
    Mendez, Erick
    Schmalstieg, Dieter
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2009, 15 (02) : 193 - 204
  • [37] Deep Learning Based Shopping Assistant For The Visually Impaired
    Pintado, Daniel
    Sanchez, Vanessa
    Adarve, Erin
    Mata, Mark
    Gogebakan, Zekeriya
    Cabuk, Bunyamin
    Chiu, Carter
    Zhan, Justin
    Gewali, Laxmi
    Oh, Paul
    2019 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2019,
  • [38] SRAVIP: Smart Robot Assistant for Visually Impaired Persons
    Albogamy, Fahad
    Alotaibi, Turk
    Alhawdan, Ghalib
    Faisal, Mohammed
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (07) : 345 - 352
  • [39] PeriTextAR: Utilizing Peripheral Vision for Reading Text on Augmented Reality Smart Glasses
    Lin, Yu-Chih
    Hsu, Leon
    Chen, Mike Y.
    24TH ACM SYMPOSIUM ON VIRTUAL REALITY SOFTWARE AND TECHNOLOGY (VRST 2018), 2018,
  • [40] Image processing with CNN in a FPGA-based augmented reality system for visually impaired people
    Toledo, FJ
    Martínez, JJ
    Garrigós, FJ
    Ferrández, JM
    COMPUTATIONAL INTELLIGENCE AND BIOINSPIRED SYSTEMS, PROCEEDINGS, 2005, 3512 : 906 - 912