Improved linear density technique for segmentation in Arabic handwritten text recognition

被引:2
|
作者
Al Hamad, Husam Ahmed [1 ]
Abualigah, Laith [1 ]
Shehab, Mohammad [2 ]
Al-Shqeerat, Khalil H. A. [3 ]
Otair, Mohammad [1 ]
机构
[1] Amman Arab Univ, Fac Comp Sci & Informat, Amman 11953, Jordan
[2] World Islamic Sci & Educ Univ, Informat Technol, Amman, Jordan
[3] Qassim Univ, Comp Sci Dept, Buraydah, Saudi Arabia
关键词
Arabic handwritten; Image segmentation; Image processing; Arabic handwriting; Text recognition; Vertical linear density;
D O I
10.1007/s11042-022-12717-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The challenge in handwriting recognition, especially in the segmentation process, took the researchers' attention. These Arabic handwritten text processes are a challenging job because their characters are generally both cursive and unconstrained. In this paper, a new segmentation technique is proposed for solving the problem of Arabic handwritten scripts, called ILDT. The proposed technique's main objective is to use the word image's vertical linear density for clarifying character boundaries and districting between characters. In the proposed method, three pre-processing steps are applied: fill close and open holes (missing circle), remove punctuation to clarify the area of ligature points and avoid characters overlapping, and crop the word image to remove excess white space. The goal of filling close and open holes is to increase the character's pixel density and then apply the vertical linear density. The proposed technique calculates the distance histogram of vertical linear, aiming to discover local minima points to precisely determine the segmentation points. Several experiments were conducted, including elapsed CPU times and accuracies values. All comparative techniques are examined on a local benchmark database. The proposed method (ILDT) got almost all the best segmentation and recognition accuracy compared with other comparative methods.
引用
收藏
页码:28531 / 28558
页数:28
相关论文
共 50 条
  • [21] An Efficient Segmentation Algorithm for Arabic Handwritten Characters Recognition System
    Fadeel, Mohamed A.
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON MATHEMATICS AND COMPUTERS IN SCIENCES AND IN INDUSTRY (MCSI 2016), 2016, : 172 - 177
  • [22] An Efficient Segmentation Algorithm for Arabic Handwritten Characters Recognition System
    Ali, Mohamed A.
    AFRO-EUROPEAN CONFERENCE FOR INDUSTRIAL ADVANCEMENT, AECIA 2014, 2015, 334 : 193 - 204
  • [23] Offine automatic segmentation based recognition of handwritten Arabic words
    Dinges, Laslo
    Al-Hamadi, Ayoub
    Elzobi, Moftah
    Al Aghbari, Zaher
    Mustafa, Hassan
    International Journal of Signal Processing, Image Processing and Pattern Recognition, 2011, 4 (04) : 131 - 144
  • [24] Segmentation of Arabic Handwritten Documents into Text Lines using Watershed Transform
    Souhar, A.
    Boulid, Y.
    Ameur, ElB.
    Ouagague, Mly. M.
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2017, 4 (06): : 96 - 102
  • [25] A method for text-line segmentation for unconstrained Arabic and Persian handwritten text image
    Shakoori, Reza
    2014 IEEE 15TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2014, : 338 - 344
  • [26] Text Segmentation of Historical Arabic Handwritten Manuscripts Using Projection Profile
    Alghamdi, Arwa
    Alluhaybi, Dareen
    Almehmadi, Doaa
    Alameer, Khadijah
    Bin Siddeq, Sundos
    Alsubait, Tahani
    2021 IEEE NATIONAL COMPUTING COLLEGES CONFERENCE (NCCC 2021), 2021, : 1012 - +
  • [27] Handwritten Arabic Documents Segmentation into Text Lines using Seam Carving
    Daldali, M.
    Souhar, A.
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2019, 5 (05): : 89 - 96
  • [28] An approach for handwritten Chinese text recognition unifying character segmentation and recognition
    Yu, Ming-Ming
    Zhang, Heng
    Yin, Fei
    Liu, Cheng-Lin
    PATTERN RECOGNITION, 2024, 151
  • [29] A transformer-based approach for Arabic offline handwritten text recognition
    Saleh Momeni
    Bagher BabaAli
    Signal, Image and Video Processing, 2024, 18 : 3053 - 3062
  • [30] Out of vocabulary word detection and recovery in Arabic handwritten text recognition
    Jemni, Sana Khamekhem
    Kessentini, Yousri
    Kanoun, Slim
    PATTERN RECOGNITION, 2019, 93 : 507 - 520