Fine-grained Pseudo Labels for Scene Text Recognition

被引:0
|
作者
Li, Xiaoyu [1 ]
Chen, Xiaoxue [1 ]
Huang, Zuming [1 ]
Xie, Lele [1 ]
Chen, Jingdong [1 ]
Yang, Ming [1 ]
机构
[1] Ant Grp, Hangzhou, Peoples R China
关键词
pseudo labels; domain shift; scene text recognition;
D O I
10.1145/3581783.3611791
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pseudo-Labeling based semi-supervised learning has shown promising advantages in Scene Text Recognition (STR). Most of them usually use a pre-trained model to generate sequence-level pseudo labels for text images and then re-train the model. Recently, conducting Pseudo-Labeling in a teacher-student framework (a student model is supervised by the pseudo labels from a teacher model) has become increasingly popular, which trains in an end-to-end manner and yields outstanding performance in semi-supervised learning. However, applying this framework directly to Pseudo-Labeling STR exhibits unstable convergence, as generating pseudo labels at the coarse-grained sequence-level leads to inefficient utilization of unlabelled data. Furthermore, the inherent domain shift between labeled and unlabeled data results in low quality of derived pseudo labels. To mitigate the above issues, we propose a novel Cross-domain Pseudo-Labeling (CPL) approach for scene text recognition, which makes better utilization of unlabeled data at the character-level and provides more accurate pseudo labels. Specifically, our proposed Pseudo-Labeled Curriculum Learning dynamically adjusts the thresholds for different character classes according to the model's learning status. Moreover, an Adaptive Distribution Regularizer is employed to bridge the domain gap and improve the quality of pseudo labels. Extensive experiments show that CPL boosts those representative STR models to achieve state-of-the-art results on six challenging STR benchmarks. Besides, it can be effectively generalized to handwritten text.
引用
收藏
页码:5786 / 5795
页数:10
相关论文
共 50 条
  • [1] Knowledge Mining with Scene Text for Fine-Grained Recognition
    Wang, Hao
    Liao, Junchao
    Cheng, Tianheng
    Gao, Zewen
    Liu, Hao
    Ren, Bo
    Bai, Xiang
    Liu, Wenyu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4614 - 4623
  • [2] Fine-Grained Language Identification in Scene Text Images
    Li, Yongrui
    Wu, Shilian
    Yu, Jun
    Wang, Zengfu
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4573 - 4581
  • [3] A fine-grained approach to scene text script identification
    Gomez, Lluis
    Karatzas, Dimosthenis
    PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 192 - 197
  • [4] Semantic Clustering for Robust Fine-Grained Scene Recognition
    George, Marian
    Dixit, Mandar
    Zogg, Gabor
    Vasconcelos, Nuno
    COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 783 - 798
  • [5] Fine-Grained Crowdsourcing for Fine-Grained Recognition
    Jia Deng
    Krause, Jonathan
    Li Fei-Fei
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 580 - 587
  • [6] Leveraging Fine-Grained Labels to Regularize Fine-Grained Visual Classification
    Wu, Junfeng
    Yao, Li
    Liu, Bin
    Ding, Zheyuan
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON COMPUTER MODELING AND SIMULATION (ICCMS 2019) AND 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND APPLICATIONS (ICICA 2019), 2019, : 133 - 136
  • [7] TextBlock: Towards Scene Text Spotting without Fine-grained Detection
    Jin Wei
    Zhang, Yuan
    Zhou, Yu
    Zeng, Gangyan
    Qiao, Zhi
    Guo, Youhui
    Wu, Haiying
    Wang, Hongbin
    Wang, Weiping
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5892 - 5902
  • [8] Integrating Scene Text and Visual Appearance for Fine-Grained Image Classification
    Bai, Xiang
    Yang, Mingkun
    Lyu, Pengyuan
    Xu, Yongchao
    Luo, Jiebo
    IEEE ACCESS, 2018, 6 : 66322 - 66335
  • [9] Scene Uyghur Text Detection Based on Fine-Grained Feature Representation
    Wang, Yiwen
    Mamat, Hornisa
    Xu, Xuebin
    Aysa, Alimjan
    Ubul, Kurban
    SENSORS, 2022, 22 (12)
  • [10] Fine-Grained Classification with Noisy Labels
    Wei, Qi
    Feng, Lei
    Sun, Haoliang
    Wang, Ren
    Guo, Chenhui
    Yin, Yilong
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11651 - 11660