Automated hand-marked semantic text recognition from photographs

被引:0
|
作者
Seungah Suh
Ghang Lee
Daeyoung Gil
Yonghan Kim
机构
[1] Yonsei University,Department of Architecture and Architectural Engineering
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Automated text recognition techniques have made significant advancements; however, certain tasks still present challenges. This study is motivated by the need to automatically recognize hand-marked text on construction defect tags among millions of photographs. To address this challenge, we investigated three methods for automating hand-marked semantic text recognition (HMSTR)—a modified scene text recognition-based (STR) approach, a two-step HMSTR approach, and a lumped approach. The STR approach involves locating marked text using an object detection model and recognizing it using a competition-winning STR model. Similarly, the two-step HMSTR approach first localizes the marked text and then recognizes the semantic text using an image classification model. By contrast, the lumped approach performs both localization and identification of marked semantic text in a single step using object detection. Among these approaches, the two-step HMSTR approach achieved the highest F1 score (0.92) for recognizing circled text, followed by the STR approach (0.87) and the lumped approach (0.78). To validate the generalizability of the two-step HMSTR approach, subsequent experiments were conducted using check-marked text, resulting in an F1 score of 0.88. Although the proposed methods have been tested specifically with tags, they can be extended to recognize marked text in reports or books.
引用
收藏
相关论文
共 50 条
  • [1] Automated hand-marked semantic text recognition from photographs
    Suh, Seungah
    Lee, Ghang
    Gil, Daeyoung
    Kim, Yonghan
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [2] Information Extraction from Hand-marked Industrial Inspection Sheets
    Gupta, Gaurav
    Swati
    Sharma, Monika
    Vig, Lovekesh
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 6, 2017, : 33 - 38
  • [3] Improved automated recognition of leopards from photographs
    Mouton, Jacobie
    Van Zijl, Lynette
    Schurch, Matthew P. E.
    AFRICAN JOURNAL OF WILDLIFE RESEARCH, 2020, 50 (01) : 197 - 205
  • [4] Semantic-Emotion Neural Network for Emotion Recognition From Text
    Batbaatar, Erdenebileg
    Li, Meijing
    Ryu, Keun Ho
    IEEE ACCESS, 2019, 7 : 111866 - 111878
  • [5] Automated recognition of forest patterns using aerial photographs
    Barbezat, V
    Kreiss, P
    Sulzmann, A
    Jacot, J
    OPTICS IN AGRICULTURE, FORESTRY, AND BIOLOGICAL PROCESSING II, 1996, 2907 : 30 - 41
  • [6] Moving automated dictation from speech recognition to structured text production
    Trost, Harald
    Jancsary, Jeremy
    Klein, Alexandra
    Matiasek, Johannes
    OGAI Journal (Oesterreichische Gesellschaft fuer Artificial Intelligence), 2009, 28 (01): : 2 - 13
  • [7] COMPUTER RECOGNITION OF HAND-PRINTED TEXT
    MUNSON, JH
    JOURNAL OF TYPOGRAPHIC RESEARCH, 1969, 3 (01): : 31 - +
  • [8] Arabic hand-written text recognition
    Saloum, SS
    ACS/IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2001, : 106 - 109
  • [9] Background-Insensitive Scene Text Recognition with Text Semantic Segmentation
    Zhao, Liang
    Wu, Zhenyao
    Wu, Xinyi
    Wilsbacher, Greg
    Wang, Song
    COMPUTER VISION, ECCV 2022, PT XXV, 2022, 13685 : 163 - 182
  • [10] Emotion recognition from text using semantic labels and separable mixture models
    Wu, Chung-Hsien
    Chuang, Ze-Jing
    Lin, Yu-Chung
    ACM Transactions on Asian Language Information Processing, 2006, 5 (02): : 165 - 182