Deep Learning-Based Text Recognition of Agricultural Regulatory Document

被引:1
|
作者
Leong, Fwa Hua [1 ]
Haur, Chan Farn [2 ]
机构
[1] Singapore Management Univ, 81 Victoria St, Singapore 188065, Singapore
[2] Syngenta Asia Pacific Pte Ltd, 1 Harbourfront Ave,Keppel Bay Tower, Singapore 098632, Singapore
关键词
Deep learning; Text detection; Optical character recognition; Regulatory document;
D O I
10.1007/978-3-031-16210-7_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, an OCR system based on deep learning techniques was deployed to digitize scanned agricultural regulatory documents comprising of certificates and labels. Recognition of the certificates and labels is challenging as they are scanned images of the hard copy form and the layout and size of the text as well as the languages vary between the various countries (due to diverse regulatory requirements). We evaluated and compared between various state-of-the-art deep learning-based text detection and recognition model as well as a packaged OCR library - Tesseract. We then adopted a two-stage approach comprising of text detection using Character Region Awareness For Text (CRAFT) followed by recognition using OCR branch of a multi-lingual text recognition algorithm E2E-MLT. A sliding windows text matcher is used to enhance the extraction of the required information such as trade names, active ingredients and crops. Initial evaluation revealed that the system performs well with a high accuracy of 91.9% for the recognition of trade names in certificates and labels and the system is currently deployed for use in Philippines, one of our collaborator's sites.
引用
收藏
页码:223 / 234
页数:12
相关论文
共 50 条
  • [31] Text recognition in natural scenes based on deep learning
    Yi Jiang
    Zhongyu Jiang
    Liang He
    Shuai Chen
    Multimedia Tools and Applications, 2022, 81 : 10545 - 10559
  • [32] Text recognition in natural scenes based on deep learning
    Jiang, Yi
    Jiang, Zhongyu
    He, Liang
    Chen, Shuai
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (08) : 10545 - 10559
  • [33] Ticket Text Detection and Recognition Based on Deep Learning
    Chen, Xiuxin
    Lv, Zhijing
    Zhu, Dongdong
    Yu, Chongchong
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 3922 - 3926
  • [34] Sample Balancing for Deep Learning-Based Visual Recognition
    Chen, Xin
    Weng, Jian
    Luo, Weiqi
    Lu, Wei
    Wu, Huimin
    Xu, Jiaming
    Tian, Qi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (10) : 3962 - 3976
  • [35] Deep learning-based image recognition for autonomous driving
    Fujiyoshi, Hironobu
    Hirakawa, Tsubasa
    Yamashita, Takayoshi
    IATSS RESEARCH, 2019, 43 (04) : 244 - 252
  • [36] Deep Learning-based Telephony Speech Recognition in the Wild
    Han, Kyu J.
    Hahm, Seongjun
    Kim, Byung-Hak
    Kim, Jungsuk
    Lane, Ian
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1323 - 1327
  • [37] A Review on Deep Learning-based Face Recognition Techniques
    Padma Suresh, L.
    Anil, J.
    2023 Innovations in Power and Advanced Computing Technologies, i-PACT 2023, 2023,
  • [38] Deep Learning-Based Violin Bowing Action Recognition
    Sun, Shih-Wei
    Liu, Bao-Yun
    Chang, Pao-Chi
    SENSORS, 2020, 20 (20) : 1 - 17
  • [39] A Deep Learning-based Unified Solution for Character Recognition
    Das, Avishek
    Rabby, A. K. M. Shahariar Azad
    Kowsar, Ibna
    Rahman, Fuad
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1671 - 1677
  • [40] Deep learning-based garbage image recognition algorithm
    Yuefei Li
    Wei Liu
    Applied Nanoscience, 2023, 13 : 1415 - 1424