Automated hand-marked semantic text recognition from photographs

被引:0
|
作者
Seungah Suh
Ghang Lee
Daeyoung Gil
Yonghan Kim
机构
[1] Yonsei University,Department of Architecture and Architectural Engineering
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Automated text recognition techniques have made significant advancements; however, certain tasks still present challenges. This study is motivated by the need to automatically recognize hand-marked text on construction defect tags among millions of photographs. To address this challenge, we investigated three methods for automating hand-marked semantic text recognition (HMSTR)—a modified scene text recognition-based (STR) approach, a two-step HMSTR approach, and a lumped approach. The STR approach involves locating marked text using an object detection model and recognizing it using a competition-winning STR model. Similarly, the two-step HMSTR approach first localizes the marked text and then recognizes the semantic text using an image classification model. By contrast, the lumped approach performs both localization and identification of marked semantic text in a single step using object detection. Among these approaches, the two-step HMSTR approach achieved the highest F1 score (0.92) for recognizing circled text, followed by the STR approach (0.87) and the lumped approach (0.78). To validate the generalizability of the two-step HMSTR approach, subsequent experiments were conducted using check-marked text, resulting in an F1 score of 0.88. Although the proposed methods have been tested specifically with tags, they can be extended to recognize marked text in reports or books.
引用
收藏
相关论文
共 50 条
  • [21] Display-Semantic Transformer for Scene Text Recognition
    Yang, Xinqi
    Silamu, Wushour
    Xu, Miaomiao
    Li, Yanbing
    SENSORS, 2023, 23 (19)
  • [22] Applying title category semantic recognition for text categorization
    School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
    Dianzi Yu Xinxi Xuebao, 2007, 12 (2885-2890):
  • [23] Digitization of historical newspapers from the perspective of automated text and structure - recognition (OCR)
    Muehlberger, Guenter
    ZEITSCHRIFT FUR BIBLIOTHEKSWESEN UND BIBLIOGRAPHIE, 2011, 58 (01): : 10 - 18
  • [24] Recognition of Hand-Written Archive Text Documents
    Czuni, Laszlo
    Szoeke, Tamas
    Gal, Monika
    COMPUTER VISION AND GRAPHICS, 2012, 7594 : 337 - 344
  • [25] Semantic Text Classification for Supporting Automated Compliance Checking in Construction
    Salama, Dareen M.
    El-Gohary, Nora M.
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2016, 30 (01)
  • [26] Visual place recognition from end-to-end semantic scene text features
    Raisi, Zobeir
    Zelek, John
    FRONTIERS IN ROBOTICS AND AI, 2024, 11
  • [27] Semantic Ranking for Automated Adversarial Technique Annotation in Security Text
    Kumarasinghe, Udesh
    Lekssays, Ahmed
    Sencar, Husrev Taha
    Boughorbel, Sabri
    Elvitigala, Charitha
    Nakov, Preslav
    PROCEEDINGS OF THE 19TH ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, ACM ASIACCS 2024, 2024, : 49 - 62
  • [28] THE RECOGNITION OF PERSONS FROM DRAWINGS AND PHOTOGRAPHS
    DAVIES, GM
    HUMAN LEARNING, 1983, 2 (03): : 237 - 249
  • [29] Automated Ptosis Measurements From Facial Photographs
    Bodnar, Zachary M.
    Neimkin, Michael
    Holds, John B.
    JAMA OPHTHALMOLOGY, 2016, 134 (02) : 146 - 150
  • [30] Hand-Gesture Recognition for Automated Speech Generation
    Patel, Sunny
    Dhar, Ujjayan
    SurajGangwani
    Lad, Rohit
    Ahire, Pallavi
    2016 IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2016, : 226 - 231