Handwritten text recognition and information extraction from ancient manuscripts using deep convolutional and recurrent neural network

被引:0
|
作者
El Bahi, Hassan [1 ]
机构
[1] L2IS, Laboratory of Computer and Systems Engineering, Cadi Ayyad University, B.P. 511, Marrakech,40000, Morocco
关键词
Deep neural networks - Long short-term memory - Multilayer neural networks - Palmprint recognition;
D O I
10.1007/s00500-024-09930-6
中图分类号
学科分类号
摘要
Digitizing ancient manuscripts and making them accessible to a broader audience is a crucial step in unlocking the wealth of information they hold. However, automatic recognition of handwritten text and the extraction of relevant information such as named entities from these manuscripts are among the most difficult research topics, due to several factors such as poor quality of manuscripts, complex background, presence of ink stains, cursive handwriting, etc. To meet these challenges, we propose two systems, the first system performs the task of handwritten text recognition (HTR) in ancient manuscripts; it starts with a preprocessing operation. Then, a convolutional neural network (CNN) is used to extract the features of each input image. Finally, a recurrent neural network (RNN) which has Long Short-Term Memory (LSTM) blocks with the Connectionist Temporal Classification (CTC) layer will predict the text contained in the image. The second system focuses on recognizing named entities and deciphering the relationships among words directly from images of old manuscripts, bypassing the need for an intermediate text transcription step. Like the previous system, this second system starts with a preprocessing step. Then the data augmentation technique is used to increase the training dataset. After that, the extraction of the most relevant features is done automatically using a CNN model. Finally, the recognition of names entities and the relationship between word images is performed using a bidirectional LSTM. Extensive experiments on the ESPOSALLES dataset demonstrate that the proposed systems achieve the state-of-the-art performance exceeding existing systems. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024.
引用
收藏
页码:12249 / 12268
页数:19
相关论文
共 50 条
  • [31] Handwritten Digit String Recognition using Convolutional Neural Network
    Zhan, Hongjian
    Lyu, Shujing
    Lu, Yue
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3729 - 3734
  • [32] Handwritten Devanagari Character Recognition using Convolutional Neural Network
    Mohite, Aarati
    Shelke, Sushama
    2018 4TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2018,
  • [33] Recognition of Urdu Handwritten Characters Using Convolutional Neural Network
    Husnain, Mujtaba
    Missen, Malik Muhammad Saad
    Mumtaz, Shahzad
    Jhanidr, Muhammad Zeeshan
    Coustaty, Mickael
    Luqman, Muhammad Muzzamil
    Ogier, Jean-Marc
    Choi, Gyu Sang
    APPLIED SCIENCES-BASEL, 2019, 9 (13):
  • [34] Bilingual handwritten numeral recognition using convolutional neural network
    Joy, Jettin
    Jayasree, M.
    EMERGING TRENDS IN ENGINEERING, SCIENCE AND TECHNOLOGY FOR SOCIETY, ENERGY AND ENVIRONMENT, 2018, : 817 - 823
  • [35] EkushNet: Using Convolutional Neural Network for Bangla Handwritten Recognition
    Rabby, A. K. M. Shahariar Azad
    Haque, Sadeka
    Abujar, Sheikh
    Hossain, Syed Akhter
    8TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATIONS (ICACC-2018), 2018, 143 : 603 - 610
  • [36] Persian Handwritten Character Recognition Using Convolutional Neural Network
    Roohi, Samad
    Alizadehashrafi, Behnam
    2017 10TH IRANIAN CONFERENCE ON MACHINE VISION AND IMAGE PROCESSING (MVIP), 2017, : 247 - 251
  • [37] Persian Handwritten Character Recognition Using Convolutional Neural Network
    Sarvaramini, Farzin
    Nasrollahzadeh, Alireza
    Soryani, Mohsen
    26TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE 2018), 2018, : 1676 - 1680
  • [38] Bangla Handwritten Digit Recognition Using Convolutional Neural Network
    Rabby, A. K. M. Shahariar Azad
    Abujar, Sheikh
    Haque, Sadeka
    Hossain, Syed Akhter
    EMERGING TECHNOLOGIES IN DATA MINING AND INFORMATION SECURITY, IEMIS 2018, VOL 1, 2019, 755 : 111 - 122
  • [39] Recognition and Solution for Handwritten Equation Using Convolutional Neural Network
    Hossain, Md Bipul
    Naznin, Feroza
    Joarder, Y. A.
    Islam, Md Zahidul
    Uddin, Md Jashim
    2018 JOINT 7TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2018 2ND INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR), 2018, : 250 - 255
  • [40] Bangla Handwritten Numeral Recognition using Convolutional Neural Network
    Akhand, M. A. H.
    Rahman, Md. Mahbubar
    Shill, P. C.
    Islam, Shahidul
    Rahman, M. M. Hafizur
    2ND INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATION COMMUNICATION TECHNOLOGY (ICEEICT 2015), 2015,