Detecting Text in the Wild with Deep Character Embedding Network

被引:3
|
作者
Li, Jiaming [1 ]
Zhang, Chengquan [1 ]
Sun, Yipeng [1 ]
Han, Junyu [1 ]
Ding, Errui [1 ]
机构
[1] Baidu Inc, Beijing, Peoples R China
来源
关键词
Text detection; Character detection; Embedding learning;
D O I
10.1007/978-3-030-20870-7_31
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Most text detection methods hypothesize texts are horizontal or multi-oriented and thus define quadrangles as the basic detection unit. However, text in the wild is usually perspectively distorted or curved, which can not be easily tackled by existing approaches. In this paper, we propose a deep character embedding network (CENet) which simultaneously predicts the bounding boxes of characters and their embedding vectors, thus making text detection a simple clustering task in the character embedding space. The proposed method does not require strong assumptions of forming a straight line on general text detection, which provides flexibility on arbitrarily curved or perspectively distorted text. For character detection task, a dense prediction subnetwork is designed to obtain the confidence score and bounding boxes of characters. For character embedding task, a subnet is trained with contrastive loss to project detected characters into embedding space. The two tasks share a backbone CNN from which the multi-scale feature maps are extracted. The final text regions can be easily achieved by a thresholding process on character confidence and embedding distance of character pairs. We evaluated our method on ICDAR13, ICDAR15, MSRA-TD500, and Total Text. The proposed method achieves state-of-the-art or comparable performance on all of the datasets, and shows a substantial improvement in the irregular-text datasets, i.e. Total-Text.
引用
收藏
页码:501 / 517
页数:17
相关论文
共 50 条
  • [31] Learning Heterogeneous Network Embedding From Text and Links
    Long, Yunfei
    Xiang, Rong
    Lu, Qin
    Xiong, Dan
    Huang, Chu-Ren
    Bi, Chenglin
    Li, Mingle
    IEEE ACCESS, 2018, 6 : 55850 - 55860
  • [32] Detecting negation and scope in Chinese clinical notes using character and word embedding
    Kang, Tian
    Zhang, Shaodian
    Xu, Nanfang
    Wen, Dong
    Zhang, Xingting
    Lei, Jianbo
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2017, 140 : 53 - 59
  • [33] Deep Residual Text Detection Network for Scene Text
    Zhu, Xiangyu
    Jiang, Yingying
    Yang, Shuli
    Wang, Xiaobing
    Li, Wei
    Fu, Pei
    Wang, Hua
    Luo, Zhenbo
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 807 - 812
  • [34] Deep Embedding Learning for Text-Dependent Speaker Verification
    Zhang, Peng
    Hu, Peng
    Zhang, Xueliang
    INTERSPEECH 2020, 2020, : 3461 - 3465
  • [35] Text Classification through Glyph-aware Disentangled Character Embedding and Semantic Sub-character Augmentation
    Aoki, Takumi
    Kitada, Shunsuke
    Iyatomi, Hitoshi
    AACL-IJCNLP 2020: THE 1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2020, : 1 - 7
  • [36] CHARACTER REGION AWARENESS NETWORK FOR SCENE TEXT RECOGNITION
    Shang, Mingyu
    Gao, Jie
    Sun, Jun
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [37] Text Region Conditional Generative Adversarial Network for Text Concealment in the Wild
    Keserwani, Prateek
    Roy, Partha Pratim
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (05) : 3152 - 3163
  • [38] Deep Feature Embedding for Accurate Recognition and Retrieval of Handwritten Text
    Krishnan, Praveen
    Dutta, Kartik
    Jawahar, C. V.
    PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 289 - 294
  • [39] Leveraging Deep Embedding Models for Arabic Text Summaries Evaluation
    Samira Ellouze
    Maher Jaoua
    SN Computer Science, 5 (7)
  • [40] Word embedding and text classification based on deep learning methods
    Li, Saihan
    Gong, Bing
    2020 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE COMMUNICATION AND NETWORK SECURITY (CSCNS2020), 2021, 336