A combined strategy of analysis for the localization of heterogeneous form fields in ancient pre-printed records

被引:0
|
作者
Aurélie Lemaitre
Jean Camillerapp
Cérès Carton
Bertrand Coüasnon
机构
[1] Univ Rennes - CNRS - IRISA,
关键词
Historical documents; Field localization; Heterogeneous layout; Rule-based system; Word spotting; Unsupervised clustering;
D O I
暂无
中图分类号
学科分类号
摘要
This paper deals with the location of handwritten fields in old pre-printed registers. The images present the difficulties of old and damaged documents, and we also have to face the difficulty of extracting the text due to the great interaction between handwritten and printed writing. In addition, in many collections, the structure of the forms varies according to the origin of the documents. This work is applied to a database of Mexican marriage records, which has been published for a competition in the workshop HIP 2013 and is publicly available. In this paper, we show the interest and limitations of the empirical method which has been submitted for the competition. We then present a method that combines a logical description of the contents of the documents, with the result of an automatic analysis on the physical properties of the collection. The particularity of this analysis is that it does not require any ground-truth. We show that this combined strategy can locate 97.2% of handwritten fields. The proposed approach is generalizable and could be applied to other databases.
引用
收藏
页码:269 / 282
页数:13
相关论文
共 12 条
  • [1] A combined strategy of analysis for the localization of heterogeneous form fields in ancient pre-printed records
    Lemaitre, Aurelie
    Camillerapp, Jean
    Carton, Ceres
    Couasnon, Bertrand
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2018, 21 (04) : 269 - 282
  • [2] Pre-Printed Form Recognition and Extraction of Data
    Dar, Mehraj-ud-Din
    Nagabhushan, P.
    Mir, A. H.
    INTERNATIONAL ELECTRONIC CONFERENCE ON COMPUTER SCIENCE, 2008, 1060 : 233 - +
  • [3] ANALYSIS OF VOLATILES FROM PRE-PRINTED FORMS
    WITT, JD
    WILCOX, CD
    LOY, DA
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1982, 184 (SEP): : 93 - ANYL
  • [4] Pre-printed and Hand-filled Table-Form Analysis aiming Cell Extraction
    Felipe, Rafaela Dandolini
    Pereira Neves, Luiz Antonio
    PROCEEDINGS OF THE 8TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, 2008, : 439 - +
  • [5] Pre-Printed Orders For Thromboprophylaxis In The Icu: A Cross-Sectional Analysis
    Centofanti, J. E.
    Zytaruk, N.
    Foster, D.
    Martinka, G.
    Hand, L.
    Karachi, T.
    Mehta, S.
    Granton, J. T.
    Muscedere, J.
    O'Callaghan, N.
    Dodek, P. M.
    Ashley, B.
    McIntyre, L. A.
    Watpool, I.
    Kutsogiannis, D. J.
    Jacka, M.
    Fowler, R.
    Wood, G.
    Auld, F.
    Cook, D. J.
    AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2014, 189
  • [6] Automated Text line Segmentation and Table detection for Pre-Printed Document Image Analysis Systems
    Rani, N. Shobha
    Pruthvi, T. R.
    Rao, Aishwarya Govinda
    Bipin, Nair B. J.
    ICSPC'21: 2021 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICPSC), 2021, : 723 - 730
  • [7] Impact of a pre-printed physician order form on the quality of care provided to critically ill intubated chronic obstructive pulmonary disease patients
    Perrault, MM
    Tulloch, K
    Cardinal, P
    Clinch, J
    CRITICAL CARE MEDICINE, 2001, 29 (12) : A157 - A157
  • [8] Structural analysis of printed mathematical expressions based on combined strategy
    Ha, Ming-Hu
    Tian, Xue-Dong
    Li, Na
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 3354 - +
  • [9] Performance Analysis of Biased Localization of Heterogeneous Nodes Combined with Pure LEACH Routing Protocol
    Lotfy, Ahmed
    Awamry, Amr A.
    Abdelhalim, M. B.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016, 2017, 533 : 681 - 691
  • [10] An automatic generation of pre-processing strategy combined with machine learning multivariate analysis for NIR spectral data
    Arianti, Nunik Destria
    Saputra, Edo
    Sitorus, Agustami
    JOURNAL OF AGRICULTURE AND FOOD RESEARCH, 2023, 13