Balanced Synthetic Data for Accurate Scene Text Spotting

被引:0
|
作者
Yao, Ying [1 ]
Huang, Zhangjin [2 ]
机构
[1] Univ Sci & Technol China, Sch Software Engn, Hefei 230051, Anhui, Peoples R China
[2] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230027, Anhui, Peoples R China
关键词
synthesize and balance; text detection; text recognition; neural networks;
D O I
10.1117/12.2503258
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
Previous approaches for scene text detection or recognition have already achieved promising performances across various benchmarks. There are a lot of superior neural network models to choose from to train the desired classifiers. Besides concentrating on designing loss functions and neural network architectures, number and quality of dataset are key to using neural networks. In this paper we propose a new method for synthesizing text in natural scene images that takes into account data balance. For each image we obtain regions normal based on depth and regions information. After choosing a text from text resource, we blend the text in the original image by using the homography matrix of original region contours and mask contours where we put text directly in. Especially, the text source is obtained by a specific loss function which reflects the distances of current characters' distribution and target characters' distribution. Text detection experiments on standard dataset ICDAR2015 and augmented dataset demonstrate that our method of balanced synthetic dataset gets an 84.5% F-score which achieves 2% increase than the result of standard dataset and is also higher than synthetic dataset without balance. Training on balanced synthetic datasets achieves great improvement of text recognition than on some public standard recognition datasets and also performs better than synthetic datasets without balance.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] LMTextSpotter: Towards Better Scene Text Spotting with Language Modeling in Transformer
    Xia, Xin
    Ding, Guodong
    Li, Siyuan
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT V, 2024, 14808 : 76 - 92
  • [22] EAST: An Efficient and Accurate Scene Text Detector
    Zhou, Xinyu
    Yao, Cong
    Wen, He
    Wang, Yuzhi
    Zhou, Shuchang
    He, Weiran
    Liang, Jiajun
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2642 - 2651
  • [23] Stroke-Based Scene Text Erasing Using Synthetic Data for Training
    Tang, Zhengmi
    Miyazaki, Tomo
    Sugaya, Yoshihiro
    Omachi, Shinichiro
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 9306 - 9320
  • [24] Augmented Text Character Proposals and Convolutional Neural Networks for Text Spotting from Scene Images
    Zamberletti, Alessandro
    Gallo, Ignazio
    Noce, Lucia
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 196 - 200
  • [25] Accurate scene modeling using synthetic imagery
    Haynes, AW
    Gilmore, MA
    Filbee, DR
    Stroud, C
    TARGETS AND BACKGROUNDS IX: CHARACTERIZATION AND REPRESENTATION, 2003, 5075 : 85 - 96
  • [26] Learning to predict more accurate text instances for scene text detection
    Li, Xiaoqian
    Liu, Jie
    Zhang, Guixuan
    Huang, Ying
    Zheng, Yang
    Zhang, Shuwu
    NEUROCOMPUTING, 2021, 449 : 455 - 463
  • [27] TextBlock: Towards Scene Text Spotting without Fine-grained Detection
    Jin Wei
    Zhang, Yuan
    Zhou, Yu
    Zeng, Gangyan
    Qiao, Zhi
    Guo, Youhui
    Wu, Haiying
    Wang, Hongbin
    Wang, Weiping
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5892 - 5902
  • [28] TOWARDS ACCURATE INSTANCE-LEVEL TEXT SPOTTING WITH GUIDED ATTENTION
    Wang, Haiyan
    Rong, Xuejian
    Tian, Yingli
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 994 - 999
  • [29] CCANet: Exploiting Pixel-wise Semantics for Irregular Scene Text Spotting
    Xu, Shanbo
    Chen, Chen
    Peng, Silong
    Hu, Xiyuan
    2021 14TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2021), 2021,
  • [30] SPTS v2: Single-Point Scene Text Spotting
    Liu, Yuliang
    Zhang, Jiaxin
    Peng, Dezhi
    Huang, Mingxin
    Wang, Xinyu
    Tang, Jingqun
    Huang, Can
    Lin, Dahua
    Shen, Chunhua
    Bai, Xiang
    Jin, Lianwen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 15665 - 15679