Deep Learning-Based Text Recognition of Agricultural Regulatory Document

被引:1
|
作者
Leong, Fwa Hua [1 ]
Haur, Chan Farn [2 ]
机构
[1] Singapore Management Univ, 81 Victoria St, Singapore 188065, Singapore
[2] Syngenta Asia Pacific Pte Ltd, 1 Harbourfront Ave,Keppel Bay Tower, Singapore 098632, Singapore
关键词
Deep learning; Text detection; Optical character recognition; Regulatory document;
D O I
10.1007/978-3-031-16210-7_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, an OCR system based on deep learning techniques was deployed to digitize scanned agricultural regulatory documents comprising of certificates and labels. Recognition of the certificates and labels is challenging as they are scanned images of the hard copy form and the layout and size of the text as well as the languages vary between the various countries (due to diverse regulatory requirements). We evaluated and compared between various state-of-the-art deep learning-based text detection and recognition model as well as a packaged OCR library - Tesseract. We then adopted a two-stage approach comprising of text detection using Character Region Awareness For Text (CRAFT) followed by recognition using OCR branch of a multi-lingual text recognition algorithm E2E-MLT. A sliding windows text matcher is used to enhance the extraction of the required information such as trade names, active ingredients and crops. Initial evaluation revealed that the system performs well with a high accuracy of 91.9% for the recognition of trade names in certificates and labels and the system is currently deployed for use in Philippines, one of our collaborator's sites.
引用
收藏
页码:223 / 234
页数:12
相关论文
共 50 条
  • [21] Deep learning-based microexpression recognition: a survey
    Wenjuan Gong
    Zhihong An
    Noha M. Elfiky
    Neural Computing and Applications, 2022, 34 : 9537 - 9560
  • [22] DEEP LEARNING-BASED HUMAN POSTURE RECOGNITION
    Ayre-Storie, Adam
    Zhang, Li
    PROCEEDINGS OF 2021 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), 2021, : 152 - 157
  • [23] Deep Learning-Based Recognition of Underwater Target
    Cao, Xu
    Zhang, Xiaomin
    Yu, Yang
    Niu, Letian
    2016 IEEE INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2016, : 89 - 93
  • [24] Deep Learning-based Text-in-Image Watermarking
    Karki, Bishwa
    Tsai, Chun-Hua
    Huang, Pei-Chi
    Zhong, Xin
    2024 IEEE 7TH INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL, MIPR 2024, 2024, : 376 - 382
  • [25] Deep Learning-Based Algorithm for Classification of News Text
    Yu Li, Xiao
    Han, Ling Bo
    Feng Jiang, Zheng
    IEEE ACCESS, 2024, 12 : 159086 - 159098
  • [26] Deep Learning-based Image Text Processing Research
    Xiong, Huixuan
    Jin, Kai
    Liu, Jingnian
    Cai, Jiahong
    Xiao, Lijun
    2023 IEEE 9TH INTL CONFERENCE ON BIG DATA SECURITY ON CLOUD, BIGDATASECURITY, IEEE INTL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING, HPSC AND IEEE INTL CONFERENCE ON INTELLIGENT DATA AND SECURITY, IDS, 2023, : 163 - 168
  • [27] Deep Learning-based Text Classification: A Comprehensive Review
    Minaee, Shervin
    Kalchbrenner, Nal
    Cambria, Erik
    Nikzad, Narjes
    Chenaghlu, Meysam
    Gao, Jianfeng
    ACM COMPUTING SURVEYS, 2022, 54 (03)
  • [28] DeepRegFinder: deep learning-based regulatory elements finder
    Ramakrishnan, Aarthi
    Wangensteen, George
    Kim, Sarah
    Nestler, Eric J.
    Shen, Li
    BIOINFORMATICS ADVANCES, 2024, 4 (01):
  • [29] Learning-Based Word Segmentation for Reliable Text Document Retrieval and Augmentation
    Lomaliza, Jean-Pierre
    Park, Hanhoon
    24TH ACM SYMPOSIUM ON VIRTUAL REALITY SOFTWARE AND TECHNOLOGY (VRST 2018), 2018,
  • [30] Deep Learning-Based Scientific Document Summarization Considering Citation
    Divya Jyoti
    Dharmendra Prasad Mahato
    Jyoti Srivastava
    SN Computer Science, 6 (4)