Balanced Synthetic Data for Accurate Scene Text Spotting

被引:0
|
作者
Yao, Ying [1 ]
Huang, Zhangjin [2 ]
机构
[1] Univ Sci & Technol China, Sch Software Engn, Hefei 230051, Anhui, Peoples R China
[2] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230027, Anhui, Peoples R China
关键词
synthesize and balance; text detection; text recognition; neural networks;
D O I
10.1117/12.2503258
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
Previous approaches for scene text detection or recognition have already achieved promising performances across various benchmarks. There are a lot of superior neural network models to choose from to train the desired classifiers. Besides concentrating on designing loss functions and neural network architectures, number and quality of dataset are key to using neural networks. In this paper we propose a new method for synthesizing text in natural scene images that takes into account data balance. For each image we obtain regions normal based on depth and regions information. After choosing a text from text resource, we blend the text in the original image by using the homography matrix of original region contours and mask contours where we put text directly in. Especially, the text source is obtained by a specific loss function which reflects the distances of current characters' distribution and target characters' distribution. Text detection experiments on standard dataset ICDAR2015 and augmented dataset demonstrate that our method of balanced synthetic dataset gets an 84.5% F-score which achieves 2% increase than the result of standard dataset and is also higher than synthetic dataset without balance. Training on balanced synthetic datasets achieves great improvement of text recognition than on some public standard recognition datasets and also performs better than synthetic datasets without balance.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] ADVMIX: DATA AUGMENTATION FOR ACCURATE SCENE TEXT SPOTTING
    Huang, Yizhang
    Fang, Kun
    Huang, Xiaolin
    Yang, Jie
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 954 - 958
  • [2] TDI TextSpotter: Taking Data Imbalance into Account in Scene Text Spotting
    Zhou, Yu
    Xie, Hongtao
    Fang, Shancheng
    Wang, Jing
    Zha, Zhengjun
    Zhang, Yongdong
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2510 - 2518
  • [3] ICDAR 2021 Competition on Scene Video Text Spotting
    Cheng, Zhanzhan
    Lu, Jing
    Zou, Baorui
    Zhou, Shuigeng
    Wu, Fei
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV, 2021, 12824 : 650 - 662
  • [4] Scene text spotting based on end-to-end
    Wei G.
    Rong W.
    Liang Y.
    Xiao X.
    Liu X.
    Journal of Intelligent and Fuzzy Systems, 2021, 40 (05): : 8871 - 8881
  • [5] Character Flow Detection and Rectification for Scene Text Spotting
    Zou, Beiji
    Yang, Wenjun
    Li, Kaiwen
    Huang, Enquan
    Liu, Shu
    ADVANCES IN COMPUTER GRAPHICS, CGI 2021, 2021, 13002 : 288 - 299
  • [6] A survey on methods, datasets and implementations for scene text spotting
    Blanco-Medina, Pablo
    Fidalgo, Eduardo
    Alegre, Enrique
    Gonzalez-Castro, Victor
    IET IMAGE PROCESSING, 2022, 16 (13) : 3426 - 3445
  • [7] Synthetic Data Generation for Text Spotting on Printed Circuit Board Component Images
    Liau, Wei Jie Brigitte
    Tay, Shiek Chi
    Mohamed, Ahmad Sufril Azlan
    Ab Wahab, Mohd Nadhir
    Lim, Lay Chuan
    Khaw, Beng Kang
    Noor, Mohd Halim Mohd
    IEEE ACCESS, 2024, 12 : 61235 - 61251
  • [8] Compact and Accurate Scene Text Detector
    Jeon, Minjun
    Jeong, Young-Seob
    APPLIED SCIENCES-BASEL, 2020, 10 (06):
  • [9] CommuSpotter: Scene Text Spotting with Multi-Task Communication
    Zhao, Liang
    Wilsbacher, Greg
    Wang, Song
    APPLIED SCIENCES-BASEL, 2023, 13 (23):
  • [10] SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition
    Huang, Mingxin
    Liu, Yuliang
    Peng, Zhenghao
    Liu, Chongyu
    Lin, Dahua
    Zhu, Shenggao
    Yuan, Nicholas
    Ding, Kai
    Jin, Lianwen
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4583 - 4593