Progressive scene text erasing with self-supervision

被引：4

作者：

Du, Xiangcheng ^{[1
]}

Zhou, Zhao ^{[1
]}

Zheng, Yingbin ^{[2
]}

Wu, Xingjiao ^{[1
]}

Ma, Tianlong ^{[1
]}

Jin, Cheng ^{[1
]}

机构：

[1] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China

[2] Videt Lab, Shanghai, Peoples R China

来源：

COMPUTER VISION AND IMAGE UNDERSTANDING | 2023年 / 233卷

关键词：

Scene text erasing; Progressive strategy; Self-supervision;

D O I：

10.1016/j.cviu.2023.103712

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Scene text erasing seeks to erase text contents from scene images and current state-of-the-art text erasing models are trained on large-scale synthetic data. Although data synthetic engines can provide vast amounts of annotated training samples, there are differences between synthetic and real-world data. In this paper, we employ self-supervision for feature representation on unlabeled real-world scene text images. A novel pretext task is designed to keep consistent among text stroke masks of image variants. We design the Progressive Erasing Network in order to remove residual texts. The scene text is erased progressively by leveraging the intermediate generated results which provide the foundation for subsequent higher quality results. Experiments show that our method significantly improves the generalization of the text erasing task and achieves state-of-the-art performance on public benchmarks.

引用

页数：10

共 50 条

[21] Disentangled Self-Supervision in Sequential Recommenders
Ma, Jianxin
Zhou, Chang
Yang, Hongxia
Cui, Peng
Wang, Xin
Zhu, Wenwu
KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 483 - 491
[22] Hyperspherically regularized networks for self-supervision
Durrant, Aiden
Leontidis, Georgios
IMAGE AND VISION COMPUTING, 2022, 124
[23] PITCH ESTIMATION VIA SELF-SUPERVISION
Gfeller, Beat
Frank, Christian
Roblek, Dominik
Sharifi, Matt
Tagliasacchi, Marco
Velimirovic, Mihajlo
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3527 - 3531
[24] TRASS: Time Reversal as Self-Supervision
Nair, Suraj
Babaeizadeh, Mohammad
Finn, Chelsea
Levine, Sergey
Kumar, Vikash
2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 115 - 121
[25] Hyperspherically regularized networks for self-supervision
Durrant, Aiden
Leontidis, Georgios
Image and Vision Computing, 2022, 124
[26] Scene text removal via cascaded text stroke detection and erasing
Xuewei Bian
Chaoqun Wang
Weize Quan
Juntao Ye
Xiaopeng Zhang
Dong-Ming Yan
Computational Visual Media, 2022, 8 (02) : 273 - 287
[27] Scene text removal via cascaded text stroke detection and erasing
Bian, Xuewei
Wang, Chaoqun
Quan, Weize
Ye, Juntao
Zhang, Xiaopeng
Yan, Dong-Ming
COMPUTATIONAL VISUAL MEDIA, 2022, 8 (02) : 273 - 287
[28] Scene text removal via cascaded text stroke detection and erasing
Xuewei Bian
Chaoqun Wang
Weize Quan
Juntao Ye
Xiaopeng Zhang
Dong-Ming Yan
Computational Visual Media, 2022, 8 : 273 - 287
[29] The IRMA dream, self-analysis, and self-supervision
Blum, H
JOURNAL OF THE AMERICAN PSYCHOANALYTIC ASSOCIATION, 1996, 44 (02) : 511 - 532
[30] Video-based spatio-temporal scene graph generation with efficient self-supervision tasks
Lianggangxu Chen
Yiqing Cai
Changhong Lu
Changbo Wang
Gaoqi He
Multimedia Tools and Applications, 2023, 82 : 38947 - 38966

← 1 2 3 4 5 →