Deep Learning-Based Text Recognition of Agricultural Regulatory Document

被引:1
|
作者
Leong, Fwa Hua [1 ]
Haur, Chan Farn [2 ]
机构
[1] Singapore Management Univ, 81 Victoria St, Singapore 188065, Singapore
[2] Syngenta Asia Pacific Pte Ltd, 1 Harbourfront Ave,Keppel Bay Tower, Singapore 098632, Singapore
关键词
Deep learning; Text detection; Optical character recognition; Regulatory document;
D O I
10.1007/978-3-031-16210-7_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, an OCR system based on deep learning techniques was deployed to digitize scanned agricultural regulatory documents comprising of certificates and labels. Recognition of the certificates and labels is challenging as they are scanned images of the hard copy form and the layout and size of the text as well as the languages vary between the various countries (due to diverse regulatory requirements). We evaluated and compared between various state-of-the-art deep learning-based text detection and recognition model as well as a packaged OCR library - Tesseract. We then adopted a two-stage approach comprising of text detection using Character Region Awareness For Text (CRAFT) followed by recognition using OCR branch of a multi-lingual text recognition algorithm E2E-MLT. A sliding windows text matcher is used to enhance the extraction of the required information such as trade names, active ingredients and crops. Initial evaluation revealed that the system performs well with a high accuracy of 91.9% for the recognition of trade names in certificates and labels and the system is currently deployed for use in Philippines, one of our collaborator's sites.
引用
收藏
页码:223 / 234
页数:12
相关论文
共 50 条
  • [1] Deep Learning-Based Image Recognition of Agricultural Pests
    Xu, Weixiao
    Sun, Lin
    Zhen, Cheng
    Liu, Bo
    Yang, Zhengyi
    Yang, Wenke
    APPLIED SCIENCES-BASEL, 2022, 12 (24):
  • [2] Deep Learning-Based Document Modeling for Personality Detection from Text
    Majumder, Navonil
    Poria, Soujanya
    Gelbukh, Alexander
    Cambria, Erik
    IEEE INTELLIGENT SYSTEMS, 2017, 32 (02) : 74 - 79
  • [3] Deep learning-based text detection and recognition on architectural floor plans
    Schoenfelder, Phillip
    Stebel, Fynn
    Andreou, Nikos
    Koenig, Markus
    AUTOMATION IN CONSTRUCTION, 2024, 157
  • [4] A Deep Learning-Based Text Detection and Recognition Approach for Natural Scenes
    Li, Xuexiang
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2023, 32 (05)
  • [5] Deep learning-based automatic recognition network of agricultural machinery images
    Zhang, Ziqiang
    Liu, Hui
    Meng, Zhijun
    Chen, Jingping
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2019, 166
  • [6] Benchmarking performance of machine and deep learning-based methodologies for Urdu text document classification
    Asim, Muhammad Nabeel
    Ghani, Muhammad Usman
    Ibrahim, Muhammad Ali
    Mahmood, Waqar
    Dengel, Andreas
    Ahmed, Sheraz
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (11): : 5437 - 5469
  • [7] Benchmarking performance of machine and deep learning-based methodologies for Urdu text document classification
    Muhammad Nabeel Asim
    Muhammad Usman Ghani
    Muhammad Ali Ibrahim
    Waqar Mahmood
    Andreas Dengel
    Sheraz Ahmed
    Neural Computing and Applications, 2021, 33 : 5437 - 5469
  • [8] A Survey of Deep Learning-Based Multimodal Emotion Recognition: Speech, Text, and Face
    Lian, Hailun
    Lu, Cheng
    Li, Sunan
    Zhao, Yan
    Tang, Chuangao
    Zong, Yuan
    ENTROPY, 2023, 25 (10)
  • [9] Deep learning-based recognition system for pashto handwritten text: benchmark on PHTI
    Hussain, Ibrar
    Ahmad, Riaz
    Ullah, Khalil
    Muhammad, Siraj
    Elhassan, Rasha
    Syed, Ikram
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [10] Deep Learning-Based Approaches for Text Recognition in PCB Optical Inspection: A Survey
    Ghosh, Shajib
    Sathiaseelan, Mukhil Azhagan Mallaiyan
    Asadizanjani, Navid
    PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON PHYSICAL ASSURANCE AND INSPECTION ON ELECTRONICS (PAINE), 2021,