OCR Based Image Text To Speech Conversion Using MATLAB

被引：0

作者：

Madre, Sneha. C. ^{[1
]}

Gundre, S. B. ^{[1
]}

机构：

[1] Govt Coll Engn, Dept Elect & Telecommun, Aurangabad 431001, Maharashtra, India

来源：

PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS) | 2018年

关键词：

OCR; Segmentation; Text Extraction; Templates; TTS; MATLAB;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

There are millions of blind people in the world who are visually impaired. Disability to read has a large impact on the life of visually impaired people. The Proposed system is cost-efficient and helps the visually impaired person to hear the text. The main idea of this project is optical Character recognition which is used to convert text character into the audio signal. The text is preprocessed and then used for recognition by segmenting each character. Segmentation is followed by extraction of the letter and resizing of the file containing the text. This Text file is then converted into the audio signal. MATLAB16 will be used for all these processes mentioned above.

引用

页码：858 / 861

页数：4

共 50 条

[31] An efficient network for farsi text to speech conversion using vowel state
Rasekh, Ehsan
Eshghi, Mohammad
TENCON 2006 - 2006 IEEE REGION 10 CONFERENCE, VOLS 1-4, 2006, : 176 - +
[32] EMOTIONAL VOICE CONVERSION USING MULTITASK LEARNING WITH TEXT-TO-SPEECH
Kim, Tae-Ho
Cho, Sungjae
Choi, Shinkook
Park, Sejik
Lee, Soo-Young
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7774 - 7778
[33] Development of a rule based approach for Bangla text to speech conversion system
Anam, A. S. M. Iftekhar
Osman, Sowkot
Chowdhury, Asif Jamil
Ali, Muhammad Masroor
ICECE 2006: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, 2006, : 181 - +
[34] Detecting Text Based Image With Optical Character Recognition for English Translation and Speech using Android
Ramiah, Sathiapriya
Liong, Tan Yu
Jayabalan, Manoj
2015 IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2015, : 272 - 277
[35] An Approach for Generating Pattern-Based Shorthand Using Speech-to-Text Conversion and Machine Learning
Abhinand, K.
Devi, H.
JOURNAL OF INTELLIGENT SYSTEMS, 2013, 22 (03) : 229 - 240
[36] Text-to-speech conversion system for Brazilian Portuguese using a formant-based synthesis technique
Gomes, LDT
Nagle, EJ
Chiquito, JG
ITS '98 PROCEEDINGS - SBT/IEEE INTERNATIONAL TELECOMMUNICATIONS SYMPOSIUM, VOLS 1 AND 2, 1998, : 219 - 224
[37] Pre-Training of DNN-Based Speech Synthesis Based on Bidirectional Conversion between Text and Speech
Sone, Kentaro
Nakashika, Toru
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (08) : 1546 - 1553
[38] COME: Clip-OCR and Master ObjEct for text image captioning
Lv, Gang
Sun, Yining
Nian, Fudong
Zhu, Maofei
Tang, Wenliang
Hu, Zhenzhen
IMAGE AND VISION COMPUTING, 2023, 136
[39] Disentangled OCR: A More Granular Information for "Text"-to-Image Retrieval
Zhou, Xinyu
Li, Shilin
Chen, Huen
Zhu, Anna
PATTERN RECOGNITION AND COMPUTER VISION, PT I, PRCV 2022, 2022, 13534 : 510 - 523
[40] OCR-VQGAN: Taming Text-within-Image Generation
Rodriguez, Juan A.
Vazquez, David
Laradji, Issam
Pedersoli, Marco
Rodriguez, Pau
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3678 - 3687

← 1 2 3 4 5 →