A Historical Handwritten French Manuscripts Text Detection Method in Full Pages

被引:0
|
作者
Sang, Rui [1 ]
Zhao, Shili [2 ]
Meng, Yan [2 ]
Zhang, Mingxian [2 ]
Li, Xuefei [2 ]
Xia, Huijie [1 ]
Zhao, Ran [2 ]
机构
[1] North China Elect Power Univ, Sch Foreign Languages, Beijing 102206, Peoples R China
[2] China Agr Univ, Coll Informat & Elect Engn, Beijing 100083, Peoples R China
关键词
French historical handwriting; complex text detection; feature enhancement; loss optimization;
D O I
10.3390/info15080483
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Historical handwritten manuscripts pose challenges to automated recognition techniques due to their unique handwriting styles and cultural backgrounds. In order to solve the problems of complex text word misdetection, omission, and insufficient detection of wide-pitch curved text, this study proposes a high-precision text detection method based on improved YOLOv8s. Firstly, the Swin Transformer is used to replace C2f at the end of the backbone network to solve the shortcomings of fine-grained information loss and insufficient learning features in text word detection. Secondly, the Dysample (Dynamic Upsampling Operator) method is used to retain more detailed features of the target and overcome the shortcomings of information loss in traditional upsampling to realize the text detection task for dense targets. Then, the LSK (Large Selective Kernel) module is added to the detection head to dynamically adjust the feature extraction receptive field, which solves the cases of extreme aspect ratio words, unfocused small text, and complex shape text in text detection. Finally, in order to overcome the CIOU (Complete Intersection Over Union) loss in target box regression with unclear aspect ratio, insensitive to size change, and insufficient correlation between target coordinates, Gaussian Wasserstein Distance (GWD) is introduced to modify the regression loss to measure the similarity between the two bounding boxes in order to obtain high-quality bounding boxes. Compared with the State-of-the-Art methods, the proposed method achieves optimal performance in text detection, with the precision and mAP@0.5 reaching 86.3% and 82.4%, which are 8.1% and 6.7% higher than the original method, respectively. The advancement of each module is verified by ablation experiments. The experimental results show that the method proposed in this study can effectively realize complex text detection and provide a powerful technical means for historical manuscript reproduction.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Text Line Extraction using DMLP Classifiers for Historical Manuscripts
    Baechler, Micheal
    Liwicki, Marcus
    Ingold, Rolf
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 1029 - 1033
  • [42] ICFHR 2016 Competition on the Analysis of Handwritten Text in Images of Balinese Palm Leaf Manuscripts
    Burie, Jean-Christophe
    Coustaty, Mickael
    Hadi, Setiawan
    Kesiman, Made Windu Antara
    Ogler, Jean -Marc
    Paulus, Erick
    Sok, Kimheng
    Sunarya, I. Made Gede
    Valy, Dona
    PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 596 - 601
  • [43] Character and Text Recognition of Khmer Historical Palm Leaf Manuscripts
    Valy, Dona
    Verleysen, Michel
    Chhun, Sophea
    Burie, Jean-Christophe
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 13 - 18
  • [44] Hybrid Binarization Method for Historical Handwritten Documents
    Asatryan, D. G.
    Haroutunian, M. E.
    Sazhumyan, G. S.
    Kupriyanov, A. V.
    Paringer, R. A.
    Kirsh, D. V.
    PROGRAMMING AND COMPUTER SOFTWARE, 2023, 49 (SUPPL 1) : S45 - S50
  • [45] Hybrid Binarization Method for Historical Handwritten Documents
    D. G. Asatryan
    M. E. Haroutunian
    G. S. Sazhumyan
    A. V. Kupriyanov
    R. A. Paringer
    D. V. Kirsh
    Programming and Computer Software, 2023, 49 : S45 - S50
  • [46] PROUST AND THE PRODUCTIVE TEXT - FRENCH - ERICKSON,JD, PAGES,I
    STRAUSS, WA
    MODERN LANGUAGE JOURNAL, 1982, 66 (01): : 92 - 93
  • [47] Separating lines of text in free-form handwritten historical documents
    Kennard, Douglas J.
    Barrett, William A.
    SECOND INTERNATIONAL CONFERENCE ON DOCUMENT IMAGE ANALYSIS FOR LIBRARIES, PROCEEDINGS, 2006, : 12 - +
  • [48] Learning-Free Text Line Segmentation for Historical Handwritten Documents
    Barakat, Berat Kurar
    Cohen, Rafi
    Droby, Ahmad
    Rabaev, Irina
    El-Sana, Jihad
    APPLIED SCIENCES-BASEL, 2020, 10 (22): : 1 - 19
  • [49] A new robust approach for altered handwritten text detection
    Patil, Gayatri
    Shivakumara, Palaiahnakote
    Gornale, Shivanand S.
    Pal, Umapada
    Blumenstein, Michael
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (14) : 20925 - 20949
  • [50] A Robust Text Line Detection in Complex Handwritten Documents
    Pach, Jakub Leszek
    Bilski, Piotr
    2015 IEEE 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS: TECHNOLOGY AND APPLICATIONS (IDAACS), VOLS 1-2, 2015, : 271 - 275