An Ocr System For Printed Nasta'liq Script: A Segmentation Based Approach

被引:0
|
作者
Naz, Saeeda [1 ,2 ]
Umar, Arif Iqbal [1 ,2 ]
Bin Ahmed, Saad [3 ]
Shirazi, Syed Hamad [1 ]
Razzak, M. Imran [3 ]
Siddiqi, Imran [4 ]
机构
[1] Hazara Univ, Dept Informat Technol, Mansehra, Pakistan
[2] KPK, Higher Educ Dept, Shimla, Pakistan
[3] King Saud Bin Abdul Aziz Univ Hlth Sci, Riyadh, Saudi Arabia
[4] Bahria Univ Islamabad, Dept Comp Sci, Islamabad, Pakistan
关键词
CHARACTER-RECOGNITION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Machine simulation of human reading has been a subject of intensive research for almost four decades. Automatic Urdu character recognition remains a challenging task due to its cursive nature despite the fact that the latest improvements in recognition methods and systems for Latin script are very promising. This work introduces a robust approach based on statistical models that provide solution for recognition of Urdu text Nasta'liq style. Contrary to classical approaches which segment text into words, ligatures or characters, we intend to employ an implicit segmentation where text lines are recognized during segmentation. The developed system will be evaluated on standard Urdu text databases and compared with the state-of-the-art recognition techniques proposed till date.
引用
收藏
页码:255 / 259
页数:5
相关论文
共 50 条
  • [1] The Role of the Safavids in the Spread and Institutionalization of the Nasta liq Script
    Shahidani, Shahab
    JOURNAL OF PERSIANATE STUDIES, 2024, 17 (1-2) : 149 - 188
  • [2] Arabic Script Based Language Character Recognition: Nasta'liq vs Naskh Analysis
    Naz, Saeeda
    Hayat, Khizar
    Razzak, Muhammad Imran
    Anwar, Muhammad Waqas
    Akbar, Habib
    WORLD CONGRESS ON COMPUTER & INFORMATION TECHNOLOGY (WCCIT 2013), 2013,
  • [3] Segmentation of Nastaliq Script for OCR
    Sattar, Sohail A.
    Hague, Shamsul
    Pathan, Mahmood K.
    COMPUTING & INFORMATICS, 2009, : 264 - 267
  • [4] Entropy-based skew detection and correction for printed Meitei/Meetei script OCR system
    Khuman, Yanglem Loijing Khomba
    Devi, H. Mamata
    Singh, N. Ajith
    MATERIALS TODAY-PROCEEDINGS, 2021, 37 : 2666 - 2669
  • [5] Segmentation of touching symbols for OCR of printed mathematical expressions: An approach based on multifactorial analysis
    Garain, U
    Chaudhuri, BB
    EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 177 - 181
  • [6] A complete OCR for printed Hindi text in Devanagari script
    Bansal, V
    Sinha, RMK
    SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 800 - 804
  • [7] A Segmentation-Free Approach for Printed Devanagari Script Recognition
    Karayil, Tushar
    Ul-Hasan, Adnan
    Breuel, Thomas M.
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 946 - 950
  • [8] An OCR System for the Meetei Mayek Script
    Ghosh, Subhankar
    Barman, Ujjwol
    Bora, P. K.
    Singh, Tourangbam Harishore
    Chaudhuri, B. B.
    2013 FOURTH NATIONAL CONFERENCE ON COMPUTER VISION, PATTERN RECOGNITION, IMAGE PROCESSING AND GRAPHICS (NCVPRIPG), 2013,
  • [9] Text and graphics segmentation of newspapers printed in Gurmukhi script: a hybrid approach
    Kaur, Rupinder Pal
    Jindal, M. K.
    Kumar, Munish
    VISUAL COMPUTER, 2021, 37 (07): : 1637 - 1659
  • [10] Text and graphics segmentation of newspapers printed in Gurmukhi script: a hybrid approach
    Rupinder Pal Kaur
    M. K. Jindal
    Munish Kumar
    The Visual Computer, 2021, 37 : 1637 - 1659