A Historical Handwritten French Manuscripts Text Detection Method in Full Pages

被引:0
|
作者
Sang, Rui [1 ]
Zhao, Shili [2 ]
Meng, Yan [2 ]
Zhang, Mingxian [2 ]
Li, Xuefei [2 ]
Xia, Huijie [1 ]
Zhao, Ran [2 ]
机构
[1] North China Elect Power Univ, Sch Foreign Languages, Beijing 102206, Peoples R China
[2] China Agr Univ, Coll Informat & Elect Engn, Beijing 100083, Peoples R China
关键词
French historical handwriting; complex text detection; feature enhancement; loss optimization;
D O I
10.3390/info15080483
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Historical handwritten manuscripts pose challenges to automated recognition techniques due to their unique handwriting styles and cultural backgrounds. In order to solve the problems of complex text word misdetection, omission, and insufficient detection of wide-pitch curved text, this study proposes a high-precision text detection method based on improved YOLOv8s. Firstly, the Swin Transformer is used to replace C2f at the end of the backbone network to solve the shortcomings of fine-grained information loss and insufficient learning features in text word detection. Secondly, the Dysample (Dynamic Upsampling Operator) method is used to retain more detailed features of the target and overcome the shortcomings of information loss in traditional upsampling to realize the text detection task for dense targets. Then, the LSK (Large Selective Kernel) module is added to the detection head to dynamically adjust the feature extraction receptive field, which solves the cases of extreme aspect ratio words, unfocused small text, and complex shape text in text detection. Finally, in order to overcome the CIOU (Complete Intersection Over Union) loss in target box regression with unclear aspect ratio, insensitive to size change, and insufficient correlation between target coordinates, Gaussian Wasserstein Distance (GWD) is introduced to modify the regression loss to measure the similarity between the two bounding boxes in order to obtain high-quality bounding boxes. Compared with the State-of-the-Art methods, the proposed method achieves optimal performance in text detection, with the precision and mAP@0.5 reaching 86.3% and 82.4%, which are 8.1% and 6.7% higher than the original method, respectively. The advancement of each module is verified by ablation experiments. The experimental results show that the method proposed in this study can effectively realize complex text detection and provide a powerful technical means for historical manuscript reproduction.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] End-to-End Handwritten Text Detection and Transcription in Full Pages
    Carbonell, Manuel
    Mas, Joan
    Villegas, Mauricio
    Fornes, Alicia
    Llados, Josep
    2019 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION WORKSHOPS (ICDARW), VOL 5, 2019, : 29 - 34
  • [2] An Evaluation of Handwritten Text Recognition Methods for Historical Ciphered Manuscripts
    Souibgui, Mohamed Ali
    Torras, Pau
    Chen, Jialuo
    Fornes, Alicia
    PROCEEDINGS OF THE 2023 INTERNATIONAL WORKSHOP ON HISTORICAL DOCUMENT IMAGING AND PROCESSING, HIP 2023, 2023, : 7 - 12
  • [3] iForal: Automated Handwritten Text Transcription for Historical Medieval Manuscripts
    Matos, Alexandre
    Almeida, Pedro
    Correia, Paulo L.
    Pacheco, Osvaldo
    JOURNAL OF IMAGING, 2025, 11 (02)
  • [4] Text Segmentation of Historical Arabic Handwritten Manuscripts Using Projection Profile
    Alghamdi, Arwa
    Alluhaybi, Dareen
    Almehmadi, Doaa
    Alameer, Khadijah
    Bin Siddeq, Sundos
    Alsubait, Tahani
    2021 IEEE NATIONAL COMPUTING COLLEGES CONFERENCE (NCCC 2021), 2021, : 1012 - +
  • [5] Text Line Detection in Corrupted and Damaged Historical Manuscripts
    Rabaev, Irina
    Biller, Ofer
    El-Sana, Jihad
    Kedem, Klara
    Dinstein, Itshak
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 812 - 816
  • [6] Semiautomatic Text Baseline Detection in Large Historical Handwritten Documents
    Bosch, Vicente
    Hector Toselli, Alejandro
    Vidal, Enrique
    2014 14th International Conference on Frontiers in Handwriting Recognition (ICFHR), 2014, : 690 - 695
  • [7] Detection of Text Lines of Handwritten Arabic Manuscripts using Markov Decision Processes
    Boulid, Youssef
    Souhar, Abdelghani
    Elkettani, Mohamed Youssfi
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2016, 4 (01): : 31 - 36
  • [8] Album of 13th century French manuscripts. Arrangement of pages and text
    Tyssens, M
    MOYEN AGE, 2003, 109 (01): : 155 - 157
  • [9] Digitizing Cyrillic Manuscripts for the Historical Dictionary of the Serbian Language Using Handwritten Text Recognition Technology
    Polomac, Vladimir
    Kuresevic, Marina
    Bjelakovic, Isidora
    Jovanovic, Aleksandra Colic
    Petrovic, Sanja
    SLOVENE-INTERNATIONAL JOURNAL OF SLAVIC STUDIES, 2023, 12 (01): : 295 - 316
  • [10] Album of 13th century French manuscripts. Arranging the pages and composing the text
    Colombo Timelli, M
    STUDI FRANCESI, 2002, 46 (03) : 661 - 662