HeritageScript: A cutting-edge approach to historical manuscript script classification with CNN and vision transformer architectures

被引:0
|
作者
Bennour, Akram [1 ]
Boudraa, Merouane [1 ]
Ghabban, Fahad [2 ]
机构
[1] Echahid Cheikh Larbi Tebessi Univ, Lab Math Informat & Syst LAMIS, Tebessa 12002, Algeria
[2] Taibah Univ, Coll Comp Sci & Engn, Medina, Saudi Arabia
来源
关键词
Script-classification; historical-manuscripts; deep-learning; CNNs; vision-transformers; transfer-learning; COMPETITION; TEXTURE;
D O I
10.3233/IDT-240565
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Determining the script of historical manuscripts is pivotal for understanding historical narratives, providing historians with vital insights into the past. In this study, our focus lies in developing an automated system for effectively identifying the script of historical documents using a deep learning approach. Leveraging the ClAMM dataset as the foundation for our system, we initiate the system with dataset preprocessing, employing two fundamental techniques: denoising through non-local means denoising and binarization using Canny-edge detection. These techniques prepare the document for keypoint detection facilitated by the Harris-corner detector, a feature-detection method. Subsequently, we cluster these keypoints utilizing the k-means algorithm and extract patches based on the identified features. The final step involves training these patches on deep learning models, with a comparative analysis between two architectures: Convolutional Neural Networks (CNN) and Vision Transformers (ViT). Given the absence of prior studies investigating the performance of vision transformers on historical manuscripts, our research fills this gap. The system undergoes a series of experiments to fine-tune its parameters for optimal performance. Our conclusive results demonstrate an average accuracy of 89.2 and 91.99% respectively of the CNN and ViT based proposed framework, surpassing the state of the art in historical script classification so far, and affirming the effectiveness of our automated script identification system.
引用
收藏
页码:2055 / 2078
页数:24
相关论文
共 6 条
  • [1] Durian Disease Classification using Vision Transformer for Cutting-Edge Disease Control
    Daud, Marizuana Mat
    Abualqumssan, Abdelrahman
    Rashid, Fadilla 'Atyka Nor
    Saad, Mohamad Hanif Md
    Zaki, Wan Mimi Diyana Wan
    Satar, Nurhizam Safie Mohd
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (12) : 446 - 452
  • [2] Revolutionizing Historical Manuscript Analysis: A Deep Learning Approach with Intelligent Feature Extraction for Script Classification
    Boudraa, Merouane
    Bennour, Akram
    Mekhaznia, Tahar
    Alqarafi, Abdulrahman
    Marie, Rashiq Rafiq
    Al-Sarem, Mohammed
    Dogra, Ayush
    ACTA INFORMATICA PRAGENSIA, 2024, 13 (02) : 251 - 272
  • [3] Advancing Leukocyte Classification: A Cutting-Edge Deep Learning Approach for AI-Driven Clinical Diagnosis
    Shaik, Ahmadsaidulu
    Tiwari, Abhishek
    Neelapu, Balachakravarthy
    Jain, Puneet Kumar
    Banoth, Earu
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (06)
  • [4] A novel approach for the detection of brain tumor and its classification via end-to-end vision transformer - CNN architecture
    Chandraprabha, K.
    Ganesan, L.
    Baskaran, K.
    FRONTIERS IN ONCOLOGY, 2025, 15
  • [5] A Deep Learning-Based Approach for Cervical Cancer Classification Using 3D CNN and Vision Transformer
    Abinaya, K.
    Sivakumar, B.
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024, 37 (01): : 280 - 296
  • [6] DRSegNet: A cutting-edge approach to Diabetic Retinopathy segmentation and classification using parameter-aware Nature-Inspired optimization
    Kamal, Sundreen Asad
    Du, Youtian
    Khalid, Majdi
    Farrash, Majed
    Dhelim, Sahraoui
    PLOS ONE, 2024, 19 (12):