Classification of incunable glyphs and out-of-distribution detection with joint energy-based models

被引:6
|
作者
Kordon, Florian [1 ]
Weichselbaumer, Nikolaus [2 ]
Herz, Randall [2 ]
Mossman, Stephen [3 ]
Potten, Edward [4 ]
Seuret, Mathias [1 ]
Mayr, Martin [1 ]
Christlein, Vincent [1 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg, Pattern Recognit Lab, Martensstr 3, D-91058 Erlangen, Germany
[2] Johannes Gutenberg Univ Mainz, Gutenberg Inst Weltliteratur & schriftorientierte, Jakob Welder Weg 18, D-55128 Mainz, Germany
[3] Univ Manchester, Sch Arts Languages & Cultures, Oxford Rd, Manchester M13 9PL, England
[4] Univ York, Ctr Medieval Studies, York YO1 7EP, England
基金
英国艺术与人文研究理事会;
关键词
Letterpress printing; Glyph extraction; Optical character recognition; Joint energy-based models; OOD detection; NETWORKS; PRODUCTS;
D O I
10.1007/s10032-023-00442-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Optical character recognition (OCR) has proved a powerful tool for the digital analysis of printed historical documents. However, its ability to localize and identify individual glyphs is challenged by the tremendous variety in historical type design, the physicality of the printing process, and the state of conservation. We propose to mitigate these problems by a downstream fine-tuning step that corrects for pathological and undesirable extraction results. We implement this idea by using a joint energy-based model which classifies individual glyphs and simultaneously prunes potential out-of-distribution (OOD) samples like rubrications, initials, or ligatures. During model training, we introduce specific margins in the energy spectrum that aid this separation and explore the glyph distribution's typical set to stabilize the optimization procedure. We observe strong classification at 0.972 AUPRC across 42 lower- and uppercase glyph types on a challenging digital reproduction of Johannes Balbus' Catholicon, matching the performance of purely discriminative methods. At the same time, we achieve OOD detection rates of 0.989 AUPRC and 0.946 AUPRC for OOD 'clutter' and 'ligatures' which substantially improves upon recently proposed OOD detection techniques. The proposed approach can be easily integrated into the postprocessing phase of current OCR to aid reproduction and shape analysis research.
引用
收藏
页码:223 / 240
页数:18
相关论文
共 50 条
  • [1] Classification of incunable glyphs and out-of-distribution detection with joint energy-based models
    Florian Kordon
    Nikolaus Weichselbaumer
    Randall Herz
    Stephen Mossman
    Edward Potten
    Mathias Seuret
    Martin Mayr
    Vincent Christlein
    International Journal on Document Analysis and Recognition (IJDAR), 2023, 26 : 223 - 240
  • [2] Exploring Energy-Based Models for Out-of-Distribution Detection in Dialect Identification
    Hao, Yaqian
    Hu, Chenguang
    Gao, Yingying
    Zhang, Shilei
    Feng, Junlan
    INTERSPEECH 2024, 2024, : 1640 - 1644
  • [3] Secure Out-of-Distribution Task Generalization with Energy-Based Models
    Chen, Shengzhuang
    Huang, Long-Kai
    Schwarz, Jonathan Richard
    Du, Yilun
    Wei, Ying
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [4] Improving Energy-Based Out-of-Distribution Detection by Sparsity Regularization
    Chen, Qichao
    Jiang, Wenjie
    Li, Kuan
    Wang, Yi
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT II, 2022, 13281 : 539 - 551
  • [5] Adversarial Training on Joint Energy Based Model for Robust Classification and Out-of-Distribution Detection
    Lee, Kyungmin
    Yang, Hunmin
    Oh, Se-Yoon
    2020 20TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2020, : 17 - 21
  • [6] Unsupervised Energy-based Out-of-distribution Detection using Stiefel-Restricted Kernel Machine
    Tonin, Francesco
    Pandey, Arun
    Patrinos, Panagiotis
    Suykens, Johan A. K.
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [7] Concurrent Misclassification and Out-of-Distribution Detection for Semantic Segmentation via Energy-Based Normalizing Flow
    Gudovskiy, Denis
    Okuno, Tomoyuki
    Nakata, Yohei
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 745 - 755
  • [8] Semantic Driven Energy based Out-of-Distribution Detection
    Joshi, Abhishek
    Chalasani, Sathish
    Iyer, Kiran Nanjunda
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [9] Latent Transformer Models for out-of-distribution detection
    Graham, Mark S.
    Tudosiu, Petru-Daniel
    Wright, Paul
    Pinaya, Walter Hugo Lopez
    Teikari, Petteri
    Patel, Ashay
    U-King-Im, Jean-Marie
    Mah, Yee H.
    Teo, James T.
    Jager, Hans Rolf
    Werring, David
    Rees, Geraint
    Nachev, Parashkev
    Ourselin, Sebastien
    Cardoso, M. Jorge
    MEDICAL IMAGE ANALYSIS, 2023, 90
  • [10] Language Models as Reasoners for Out-of-Distribution Detection
    Kirchheim, Konstantin
    Ortmeier, Frank
    COMPUTER SAFETY, RELIABILITY, AND SECURITY. SAFECOMP 2024 WORKSHOPS, 2024, 14989 : 379 - 390