Balanced Synthetic Data for Accurate Scene Text Spotting

被引:0
|
作者
Yao, Ying [1 ]
Huang, Zhangjin [2 ]
机构
[1] Univ Sci & Technol China, Sch Software Engn, Hefei 230051, Anhui, Peoples R China
[2] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230027, Anhui, Peoples R China
关键词
synthesize and balance; text detection; text recognition; neural networks;
D O I
10.1117/12.2503258
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
Previous approaches for scene text detection or recognition have already achieved promising performances across various benchmarks. There are a lot of superior neural network models to choose from to train the desired classifiers. Besides concentrating on designing loss functions and neural network architectures, number and quality of dataset are key to using neural networks. In this paper we propose a new method for synthesizing text in natural scene images that takes into account data balance. For each image we obtain regions normal based on depth and regions information. After choosing a text from text resource, we blend the text in the original image by using the homography matrix of original region contours and mask contours where we put text directly in. Especially, the text source is obtained by a specific loss function which reflects the distances of current characters' distribution and target characters' distribution. Text detection experiments on standard dataset ICDAR2015 and augmented dataset demonstrate that our method of balanced synthetic dataset gets an 84.5% F-score which achieves 2% increase than the result of standard dataset and is also higher than synthetic dataset without balance. Training on balanced synthetic datasets achieves great improvement of text recognition than on some public standard recognition datasets and also performs better than synthetic datasets without balance.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] ACCURATE AND ROBUST SCENE TEXT RECOGNITION VIA ADVERSARIAL TRAINING
    Yang, Xiaomeng
    Yang, Dongbao
    Qiao, Zhi
    Zhou, Yu
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 4275 - 4279
  • [42] Accurate Scene Text Recognition Based on Recurrent Neural Network
    Su, Bolan
    Lu, Shijian
    COMPUTER VISION - ACCV 2014, PT I, 2015, 9003 : 35 - 48
  • [43] Accurate Scene Text Detection Via Scale-Aware Data Augmentation and Shape Similarity Constraint
    Dai, Pengwen
    Li, Yang
    Zhang, Hua
    Li, Jingzhi
    Cao, Xiaochun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1883 - 1895
  • [44] Conceptual text region network: Cognition-inspired accurate scene text detection
    Cui, Chenwei
    Lu, Liangfu
    Tan, Zhiyuan
    Hussain, Amir
    NEUROCOMPUTING, 2021, 464 : 252 - 264
  • [45] Accurate and Robust Text Detection: A Step-In for Text Retrieval in Natural Scene Images
    Yin, Xu-Cheng
    Yin, Xuwang
    Huang, Kaizhu
    Hao, Hong-Wei
    SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 1091 - 1092
  • [46] Text Spotting Transformers
    Zhang, Xiang
    Su, Yongwen
    Tripathi, Subarna
    Tu, Zhuowen
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9509 - 9518
  • [47] WACNET:WORD SEGMENTATION GUIDED CHARACTERS AGGREGATION NET FOR SCENE TEXT SPOTTING WITH ARBITRARY SHAPES
    Gao, Yuting
    Huang, Zheng
    Dai, Yuchen
    Chen, Kai
    Guo, Jie
    Qiu, Weidong
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3382 - 3386
  • [48] TTS: Hilbert Transform-Based Generative Adversarial Network for Tattoo and Scene Text Spotting
    Banerjee, Ayan
    Palaiahnakote, Shivakumara
    Pal, Umapada
    Antonacopoulos, Apostolos
    Lu, Tong
    Canet, Josep Llados
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8226 - 8241
  • [49] Revisiting Scene Text Recognition: A Data Perspective
    Jiang, Qing
    Wang, Jiapeng
    Peng, Dezhi
    Liu, Chongyu
    Jin, Lianwen
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 20486 - 20497
  • [50] SCENE TEXT SEGMENTATION BY PAIRED DATA SYNTHESIS
    Quang-Vinh Dang
    Lee, Guee-Sang
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 545 - 549