An Ocr System For Printed Nasta'liq Script: A Segmentation Based Approach

被引:0
|
作者
Naz, Saeeda [1 ,2 ]
Umar, Arif Iqbal [1 ,2 ]
Bin Ahmed, Saad [3 ]
Shirazi, Syed Hamad [1 ]
Razzak, M. Imran [3 ]
Siddiqi, Imran [4 ]
机构
[1] Hazara Univ, Dept Informat Technol, Mansehra, Pakistan
[2] KPK, Higher Educ Dept, Shimla, Pakistan
[3] King Saud Bin Abdul Aziz Univ Hlth Sci, Riyadh, Saudi Arabia
[4] Bahria Univ Islamabad, Dept Comp Sci, Islamabad, Pakistan
关键词
CHARACTER-RECOGNITION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Machine simulation of human reading has been a subject of intensive research for almost four decades. Automatic Urdu character recognition remains a challenging task due to its cursive nature despite the fact that the latest improvements in recognition methods and systems for Latin script are very promising. This work introduces a robust approach based on statistical models that provide solution for recognition of Urdu text Nasta'liq style. Contrary to classical approaches which segment text into words, ligatures or characters, we intend to employ an implicit segmentation where text lines are recognized during segmentation. The developed system will be evaluated on standard Urdu text databases and compared with the state-of-the-art recognition techniques proposed till date.
引用
收藏
页码:255 / 259
页数:5
相关论文
共 50 条
  • [21] An old greek handwritten OCR system based on an efficient segmentation-free approach
    Ntzios, K.
    Gatos, B.
    Pratikakis, I.
    Konidaris, T.
    Perantonis, S. J.
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2007, 9 (2-4) : 179 - 192
  • [22] Text segmentation of machine printed Gurmukhi script
    Lehal, GS
    Singh, C
    DOCUMENT RECOGNITION AND RETRIEVAL VIII, 2001, 4307 : 223 - 231
  • [23] Lipi Gnani: A Versatile OCR for Documents in any Language Printed in Kannada Script
    Kumar, H. R. Shiva
    Ramakrishnan, A. G.
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (04)
  • [24] A segmentation-free approach to recognise printed Sinhala script using linear symmetry
    Premaratne, HL
    Bigun, J
    PATTERN RECOGNITION, 2004, 37 (10) : 2081 - 2089
  • [25] An OCR System for Printed Indic Scripts
    Patnaik, Tushar
    INFORMATION SYSTEMS FOR INDIAN LANGUAGES, 2011, 139 : 309 - 310
  • [26] An OCR system for printed Indic scripts
    Patnaik, Tushar
    Communications in Computer and Information Science, 2011, 139 CCIS : 309 - 310
  • [27] Complete printed Bangla OCR system
    Indian Statistical Inst, Calcutta, India
    Pattern Recognit, 5 (531-549):
  • [28] Thinning: A Preprocessing Technique for an OCR System for the Brahmi Script
    Devi, H. K. Anasuya
    ANCIENT ASIA-JOURNAL OF THE SOCIETY OF SOUTH ASIAN ARCHAEOLOGY, 2006, 1 : 167 - 172
  • [29] A Transliteration Based Word Segmentation System for Shahmukhi Script
    Lehal, Gurpreet Singh
    Saini, Tejinder Singh
    INFORMATION SYSTEMS FOR INDIAN LANGUAGES, 2011, 139 : 136 - 143
  • [30] A complete printed Bangla OCR system
    Chaudhuri, BB
    Pal, U
    PATTERN RECOGNITION, 1998, 31 (05) : 531 - 549