Progressive scene text erasing with self-supervision

被引:4
|
作者
Du, Xiangcheng [1 ]
Zhou, Zhao [1 ]
Zheng, Yingbin [2 ]
Wu, Xingjiao [1 ]
Ma, Tianlong [1 ]
Jin, Cheng [1 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China
[2] Videt Lab, Shanghai, Peoples R China
关键词
Scene text erasing; Progressive strategy; Self-supervision;
D O I
10.1016/j.cviu.2023.103712
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene text erasing seeks to erase text contents from scene images and current state-of-the-art text erasing models are trained on large-scale synthetic data. Although data synthetic engines can provide vast amounts of annotated training samples, there are differences between synthetic and real-world data. In this paper, we employ self-supervision for feature representation on unlabeled real-world scene text images. A novel pretext task is designed to keep consistent among text stroke masks of image variants. We design the Progressive Erasing Network in order to remove residual texts. The scene text is erased progressively by leveraging the intermediate generated results which provide the foundation for subsequent higher quality results. Experiments show that our method significantly improves the generalization of the text erasing task and achieves state-of-the-art performance on public benchmarks.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Disentangled Self-Supervision in Sequential Recommenders
    Ma, Jianxin
    Zhou, Chang
    Yang, Hongxia
    Cui, Peng
    Wang, Xin
    Zhu, Wenwu
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 483 - 491
  • [22] Hyperspherically regularized networks for self-supervision
    Durrant, Aiden
    Leontidis, Georgios
    IMAGE AND VISION COMPUTING, 2022, 124
  • [23] PITCH ESTIMATION VIA SELF-SUPERVISION
    Gfeller, Beat
    Frank, Christian
    Roblek, Dominik
    Sharifi, Matt
    Tagliasacchi, Marco
    Velimirovic, Mihajlo
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3527 - 3531
  • [24] TRASS: Time Reversal as Self-Supervision
    Nair, Suraj
    Babaeizadeh, Mohammad
    Finn, Chelsea
    Levine, Sergey
    Kumar, Vikash
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 115 - 121
  • [25] Hyperspherically regularized networks for self-supervision
    Durrant, Aiden
    Leontidis, Georgios
    Image and Vision Computing, 2022, 124
  • [26] Scene text removal via cascaded text stroke detection and erasing
    Xuewei Bian
    Chaoqun Wang
    Weize Quan
    Juntao Ye
    Xiaopeng Zhang
    Dong-Ming Yan
    Computational Visual Media, 2022, 8 (02) : 273 - 287
  • [27] Scene text removal via cascaded text stroke detection and erasing
    Bian, Xuewei
    Wang, Chaoqun
    Quan, Weize
    Ye, Juntao
    Zhang, Xiaopeng
    Yan, Dong-Ming
    COMPUTATIONAL VISUAL MEDIA, 2022, 8 (02) : 273 - 287
  • [28] Scene text removal via cascaded text stroke detection and erasing
    Xuewei Bian
    Chaoqun Wang
    Weize Quan
    Juntao Ye
    Xiaopeng Zhang
    Dong-Ming Yan
    Computational Visual Media, 2022, 8 : 273 - 287
  • [29] The IRMA dream, self-analysis, and self-supervision
    Blum, H
    JOURNAL OF THE AMERICAN PSYCHOANALYTIC ASSOCIATION, 1996, 44 (02) : 511 - 532
  • [30] Video-based spatio-temporal scene graph generation with efficient self-supervision tasks
    Lianggangxu Chen
    Yiqing Cai
    Changhong Lu
    Changbo Wang
    Gaoqi He
    Multimedia Tools and Applications, 2023, 82 : 38947 - 38966