A deep learning based system for writer identification in handwritten Arabic historical manuscripts

被引:0
|
作者
Michel Chammas
Abdallah Makhoul
Jacques Demerjian
Elie Dannaoui
机构
[1] University of Balamand,Digital Humanities Center
[2] Université de Bourgogne Franche-Comté,Femto
[3] Lebanese University,ST Institute, UMR CNRS 6174
来源
关键词
Writer identification; Historical documents; Artificial intelligence; Document analysis; Arabic manuscripts;
D O I
暂无
中图分类号
学科分类号
摘要
Determining the writer or transcriber of historical Arabic manuscripts has always been a major challenge for researchers in the field of humanities. With the development of advanced techniques in pattern recognition and machine learning, these technologies have been applied to automate the extraction of paleographical features in order to solve this issue. This paper presents a baseline system for writer identification, tested on a Historical Arabic dataset of 11610 single and double folio images. These texts were extracted from a unique collection of 567 Historical Arabic Manuscripts available at the Balamand Digital Humanities Center. A survey has been conducted on the available Arabic datasets and previously proposed techniques and algorithms. The Balamand dataset presents an important challenge due to the geo-historical identity of manuscripts and their physical conditions. An advanced Deep Learning system was developed and tested on three different Latin and Arabic datasets: ICDAR19, ICFHR20 and KHATT, before testing it on the Balamand dataset. The system was compared with many other systems and it has yielded a state-of-the-art performance on the new challenging images with 95.2% mean Average Precision (mAP) and 98.1% accuracy.
引用
收藏
页码:30769 / 30784
页数:15
相关论文
共 50 条
  • [31] Learning-based word spotting system for Arabic handwritten documents
    Khayyat, Muna
    Lam, Louisa
    Suen, Ching Y.
    PATTERN RECOGNITION, 2014, 47 (03) : 1021 - 1030
  • [32] A new Arabic handwritten character recognition deep learning system (AHCR-DLS)
    Balaha, Hossam Magdy
    Ali, Hesham Arafat
    Saraya, Mohamed
    Badawy, Mahmoud
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (11): : 6325 - 6367
  • [33] A new Arabic handwritten character recognition deep learning system (AHCR-DLS)
    Hossam Magdy Balaha
    Hesham Arafat Ali
    Mohamed Saraya
    Mahmoud Badawy
    Neural Computing and Applications, 2021, 33 : 6325 - 6367
  • [34] Writer Identification from Nordic Historical Manuscripts using Transformer Networks
    Adak, Chandranath
    Jaswanth, Batturi
    Akhtar, Zahid
    Kasen, Andre
    Chanda, Sukalpa
    2023 IEEE INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS, IJCB, 2023,
  • [35] Fractal-Based System for Arabic/Latin, Printed/Handwritten Script Identification
    Ben Moussa, S.
    Zahour, A.
    Benabdelhafid, A.
    Alimi, A. M.
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 2643 - 2646
  • [36] Offline Arabic handwritten character recognition: from conventional machine learning system to deep learning approaches
    Faouci, Soumia
    Gaceb, Djamel
    Haddad, Mohammed
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2022, 25 (04) : 385 - 398
  • [37] Deep Learning-Based Child Handwritten Arabic Character Recognition and Handwriting Discrimination
    Alwagdani, Maram Saleh
    Jaha, Emad Sami
    SENSORS, 2023, 23 (15)
  • [38] Exploring Deep Learning Approaches to Recognize Handwritten Arabic Texts
    Eltay, Mohamed
    Zidouri, Abdelmalek
    Ahmad, Irfan
    IEEE ACCESS, 2020, 8 : 89882 - 89898
  • [39] Hybrid Trainable System for Writer Identification of Arabic Handwriting
    Saleem, Saleem Ibraheem
    Abdulazeez, Adnan Mohsin
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 68 (03): : 3353 - 3372
  • [40] Writer Identification From Historical Documents Using Ensemble Deep Learning Transfer Models
    Babic, Radmila Jankovic
    Amelio, Alessia
    Draganov, Ivo R.
    2022 21ST INTERNATIONAL SYMPOSIUM INFOTEH-JAHORINA (INFOTEH), 2022,