Margin Guidance Network for Arbitrary-shaped Scene Text Detection

被引:0
|
作者
Li, Xin [1 ]
Wu, Xingjiao [1 ]
Ma, Tianlong [1 ]
Zhou, Zhao [2 ]
Chen, Luhui [2 ]
He, Liang [1 ]
机构
[1] East China Normal Univ, Shanghai, Peoples R China
[2] Videt Tech Ltd, Shanghai, Peoples R China
关键词
Scene text detection; Margin Guidance Network; arbitrary-shaped text;
D O I
10.1109/ICTAI50040.2020.00169
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Segmentation-based scene text detection approaches have been adopted to arbitrary-shaped texts and have achieved a great progress. However, false detection always easily exist when the arbitrary-shaped texts are close to each other. In this paper, we propose the Margin Guidance Network (MGN) that mainly based on the margin constraint residual module (MCRM) to address aforementioned problem. The MCRM considers the margins between multiple text instance masks to guide the training of network and improve the performance on text detection. The MCRM contains two prediction branch, the one can generate the multiple different scale of masks for a text instance and the other branch is used to generate multiple margins between the above masks. Experimental results on three public benchmarks including ICDAR2015, CTW1500 and Total-Text have demonstrated that the proposed MGN achieves the state-of-the-art results.
引用
收藏
页码:1111 / 1117
页数:7
相关论文
共 50 条
  • [21] Fourier Contour Embedding for Arbitrary-Shaped Text Detection
    Zhu, Yiqin
    Chen, Jianyong
    Liang, Lingyu
    Kuang, Zhanghui
    Jin, Lianwen
    Zhang, Wayne
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3122 - 3130
  • [22] ReLaText: Exploiting visual relationships for arbitrary-shaped scene text detection with graph convolutional networks
    Ma, Chixiang
    Sun, Lei
    Zhong, Zhuoyao
    Huo, Qiang
    PATTERN RECOGNITION, 2021, 111
  • [23] Learning Pixel Affinity Pyramid for Arbitrary-Shaped Text Detection
    Fu, Zilong
    Xie, Hongtao
    Fang, Shancheng
    Wang, Yuxin
    Xing, Mengting
    Zhang, Yongdong
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)
  • [24] Focus Entirety and Perceive Environment for Arbitrary-Shaped Text Detection
    Han, Xu
    Gao, Junyu
    Yang, Chuang
    Yuan, Yuan
    Wang, Qi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 287 - 299
  • [25] Reading Arbitrary-Shaped Scene Text from Images Through Spline Regression and Rectification
    Chen, Long
    Su, Feng
    Shi, Jiahao
    Qian, Ye
    COMPUTER VISION - ACCV 2022, PT V, 2023, 13845 : 107 - 123
  • [26] Which and Where to Focus: A Simple yet Accurate Framework for Arbitrary-Shaped Nearby Text Detection in Scene Images
    Guo, Youhui
    Zhou, Yu
    Qin, Xugong
    Wang, Weiping
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2021, PT V, 2021, 12895 : 271 - 283
  • [27] BIP-NET: BIDIRECTIONAL PERSPECTIVE STRATEGY BASED ARBITRARY-SHAPED TEXT DETECTION NETWORK
    Yang, Chuang
    Chen, Mulin
    Yuan, Yuan
    Wang, Qi
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2255 - 2259
  • [28] Region-Aware Arbitrary-Shaped Text Detection With Progressive Fusion
    Wang, Qitong
    Fu, Bin
    Li, Ming
    He, Junjun
    Peng, Xi
    Qiao, Yu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4718 - 4729
  • [29] Fast arbitrary shaped scene text detection via text discriminator
    Guizhou Institute of Technology, Guiyzhou, Guiyang, China
    不详
    J. Phys. Conf. Ser., 1742, 1
  • [30] POINTER NETWORKS FOR ARBITRARY-SHAPED TEXT SPOTTING
    Zhang, Yi
    Yang, Wei
    Xu, Zhenbo
    Li, Yingjie
    Chen, Zhi
    Huang, Liusheng
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2375 - 2379