OCR Based Image Text To Speech Conversion Using MATLAB

被引:0
|
作者
Madre, Sneha. C. [1 ]
Gundre, S. B. [1 ]
机构
[1] Govt Coll Engn, Dept Elect & Telecommun, Aurangabad 431001, Maharashtra, India
来源
PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS) | 2018年
关键词
OCR; Segmentation; Text Extraction; Templates; TTS; MATLAB;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There are millions of blind people in the world who are visually impaired. Disability to read has a large impact on the life of visually impaired people. The Proposed system is cost-efficient and helps the visually impaired person to hear the text. The main idea of this project is optical Character recognition which is used to convert text character into the audio signal. The text is preprocessed and then used for recognition by segmenting each character. Segmentation is followed by extraction of the letter and resizing of the file containing the text. This Text file is then converted into the audio signal. MATLAB16 will be used for all these processes mentioned above.
引用
收藏
页码:858 / 861
页数:4
相关论文
共 50 条
  • [31] An efficient network for farsi text to speech conversion using vowel state
    Rasekh, Ehsan
    Eshghi, Mohammad
    TENCON 2006 - 2006 IEEE REGION 10 CONFERENCE, VOLS 1-4, 2006, : 176 - +
  • [32] EMOTIONAL VOICE CONVERSION USING MULTITASK LEARNING WITH TEXT-TO-SPEECH
    Kim, Tae-Ho
    Cho, Sungjae
    Choi, Shinkook
    Park, Sejik
    Lee, Soo-Young
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7774 - 7778
  • [33] Development of a rule based approach for Bangla text to speech conversion system
    Anam, A. S. M. Iftekhar
    Osman, Sowkot
    Chowdhury, Asif Jamil
    Ali, Muhammad Masroor
    ICECE 2006: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, 2006, : 181 - +
  • [34] Detecting Text Based Image With Optical Character Recognition for English Translation and Speech using Android
    Ramiah, Sathiapriya
    Liong, Tan Yu
    Jayabalan, Manoj
    2015 IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2015, : 272 - 277
  • [35] An Approach for Generating Pattern-Based Shorthand Using Speech-to-Text Conversion and Machine Learning
    Abhinand, K.
    Devi, H.
    JOURNAL OF INTELLIGENT SYSTEMS, 2013, 22 (03) : 229 - 240
  • [36] Text-to-speech conversion system for Brazilian Portuguese using a formant-based synthesis technique
    Gomes, LDT
    Nagle, EJ
    Chiquito, JG
    ITS '98 PROCEEDINGS - SBT/IEEE INTERNATIONAL TELECOMMUNICATIONS SYMPOSIUM, VOLS 1 AND 2, 1998, : 219 - 224
  • [37] Pre-Training of DNN-Based Speech Synthesis Based on Bidirectional Conversion between Text and Speech
    Sone, Kentaro
    Nakashika, Toru
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (08) : 1546 - 1553
  • [38] COME: Clip-OCR and Master ObjEct for text image captioning
    Lv, Gang
    Sun, Yining
    Nian, Fudong
    Zhu, Maofei
    Tang, Wenliang
    Hu, Zhenzhen
    IMAGE AND VISION COMPUTING, 2023, 136
  • [39] Disentangled OCR: A More Granular Information for "Text"-to-Image Retrieval
    Zhou, Xinyu
    Li, Shilin
    Chen, Huen
    Zhu, Anna
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, PRCV 2022, 2022, 13534 : 510 - 523
  • [40] OCR-VQGAN: Taming Text-within-Image Generation
    Rodriguez, Juan A.
    Vazquez, David
    Laradji, Issam
    Pedersoli, Marco
    Rodriguez, Pau
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3678 - 3687