An augmented reality for an arabic text reading and visualization assistant for the visually impaired

被引：3

作者：

Ouali, Imene ^{[1
]}

Ben Halima, Mohamed ^{[1
]}

Wali, Ali ^{[1
]}

机构：

[1] Univ Sfax, Natl Engn Sch Sfax ENIS, REGIM Res Grp Intelligent Machines, Sfax, Tunisia

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2023年 / 82卷 / 28期

关键词：

Text visualization; Text to speech; Text detection; Text recognition; Arabic; Augmented reality; Deep learning; Visually impaired; RECOGNITION;

D O I：

10.1007/s11042-023-14880-6

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Text, as one of humanity's most influential innovations, has played an important role in shaping our lives. Reading a text is a difficult task due to several reasons factors, such as luminosity, text orientation, writing style, and very light colors. However, the visually impaired, on the other hand, have difficulty reading a text in all of these situations. In particular, a handwritten text is more difficult to read than a digital text due to the different forms and styles of the handwriting of different writers or, sometimes, of the same writer. Therefore, they would benefit from a device or a system to help them to solve this problem and improve their quality of life. Arabic language recognition and identification is a very difficult task because of diacritics such as consonant score, tashkil, and others. In this context, we propose a recognition and identification system for Arabic Handwritten Texts with Diacritics (AHTD) based on deep learning by using the convolutional neural network. Text images are trained, tested, and validated with our Arabic Handwritten Texts with a Diacritical Dataset (AHT2D). Then, the recognized text is enhanced with augmented reality technology and produced as a 2D image. Finally, the recognized text is converted into an audio output using AR technology. Voice output and visual output are given to the visually impaired user. The experimental results show that the proposed system is robust, with an accuracy rate of 95%.

引用

页码：43569 / 43597

页数：29

共 50 条

[31] TEXT STRING DETECTION FOR THE FIRST GRADE VISUALLY IMPAIRED PUPILS "READING MANDARIN TEXTBOOKS"
Cheng, Ching-Ching
Tseng, Teng-Hui
Tsai, Chun-Ming
PROCEEDINGS OF 2014 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 1, 2014, : 377 - 382
[32] ENHANCEMENT OF TEXT FOR THE VISUALLY-IMPAIRED
FINE, EM
PELI, E
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1995, 12 (07): : 1439 - 1447
[33] Narrative Visualization with Augmented Reality
Marques, Ana Beatriz
Branco, Vasco
Costa, Rui
MULTIMODAL TECHNOLOGIES AND INTERACTION, 2022, 6 (12)
[34] Collaborative visualization in augmented reality
Fuhrmann, A
Loffelmann, H
Schmalstieg, D
Gervautz, M
IEEE COMPUTER GRAPHICS AND APPLICATIONS, 1998, 18 (04) : 54 - 59
[35] PeriText: Utilizing Peripheral Vision for Reading Text on Augmented Reality Smart Glasses
Ku, Pin-Sung
Peng, Yi-Hao
Lin, Yu-Chih
Chen, Mike Y.
2019 26TH IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES (VR), 2019, : 630 - 635
[36] Comprehensible Visualization for Augmented Reality
Kalkofen, Denis
Mendez, Erick
Schmalstieg, Dieter
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2009, 15 (02) : 193 - 204
[37] Deep Learning Based Shopping Assistant For The Visually Impaired
Pintado, Daniel
Sanchez, Vanessa
Adarve, Erin
Mata, Mark
Gogebakan, Zekeriya
Cabuk, Bunyamin
Chiu, Carter
Zhan, Justin
Gewali, Laxmi
Oh, Paul
2019 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2019,
[38] SRAVIP: Smart Robot Assistant for Visually Impaired Persons
Albogamy, Fahad
Alotaibi, Turk
Alhawdan, Ghalib
Faisal, Mohammed
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (07) : 345 - 352
[39] PeriTextAR: Utilizing Peripheral Vision for Reading Text on Augmented Reality Smart Glasses
Lin, Yu-Chih
Hsu, Leon
Chen, Mike Y.
24TH ACM SYMPOSIUM ON VIRTUAL REALITY SOFTWARE AND TECHNOLOGY (VRST 2018), 2018,
[40] Image processing with CNN in a FPGA-based augmented reality system for visually impaired people
Toledo, FJ
Martínez, JJ
Garrigós, FJ
Ferrández, JM
COMPUTATIONAL INTELLIGENCE AND BIOINSPIRED SYSTEMS, PROCEEDINGS, 2005, 3512 : 906 - 912

← 1 2 3 4 5 →