Detecting Text in the Wild with Deep Character Embedding Network

被引:3
|
作者
Li, Jiaming [1 ]
Zhang, Chengquan [1 ]
Sun, Yipeng [1 ]
Han, Junyu [1 ]
Ding, Errui [1 ]
机构
[1] Baidu Inc, Beijing, Peoples R China
来源
关键词
Text detection; Character detection; Embedding learning;
D O I
10.1007/978-3-030-20870-7_31
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Most text detection methods hypothesize texts are horizontal or multi-oriented and thus define quadrangles as the basic detection unit. However, text in the wild is usually perspectively distorted or curved, which can not be easily tackled by existing approaches. In this paper, we propose a deep character embedding network (CENet) which simultaneously predicts the bounding boxes of characters and their embedding vectors, thus making text detection a simple clustering task in the character embedding space. The proposed method does not require strong assumptions of forming a straight line on general text detection, which provides flexibility on arbitrarily curved or perspectively distorted text. For character detection task, a dense prediction subnetwork is designed to obtain the confidence score and bounding boxes of characters. For character embedding task, a subnet is trained with contrastive loss to project detected characters into embedding space. The two tasks share a backbone CNN from which the multi-scale feature maps are extracted. The final text regions can be easily achieved by a thresholding process on character confidence and embedding distance of character pairs. We evaluated our method on ICDAR13, ICDAR15, MSRA-TD500, and Total Text. The proposed method achieves state-of-the-art or comparable performance on all of the datasets, and shows a substantial improvement in the irregular-text datasets, i.e. Total-Text.
引用
收藏
页码:501 / 517
页数:17
相关论文
共 50 条
  • [21] Detecting Text in Natural Image with Connectionist Text Proposal Network
    Tian, Zhi
    Huang, Weilin
    He, Tong
    He, Pan
    Qiao, Yu
    COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 : 56 - 72
  • [22] Deep network embedding with dimension selection
    Dong, Tianning
    Sun, Yan
    Liang, Faming
    NEURAL NETWORKS, 2024, 179
  • [23] Deep Partial Multiplex Network Embedding
    Wang, Qifan
    Fang, Yi
    Ravula, Anirudh
    He, Ruining
    Shen, Bin
    Wang, Jingang
    Quan, Xiaojun
    Liu, Dongfang
    COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2022, WWW 2022 COMPANION, 2022, : 1053 - 1062
  • [24] Deep locally linear embedding network
    Wang, Jiaming
    Shao, Zhenfeng
    Huang, Xiao
    Lu, Tao
    Zhang, Ruiqian
    Chen, Xitong
    INFORMATION SCIENCES, 2022, 614 : 416 - 431
  • [25] Network Embedding with Deep Metric Learning
    Cheng, Xiaotao
    Ji, Lixin
    Huang, Ruiyang
    Cui, Ruifei
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (03) : 568 - 578
  • [26] Deep Contrastive Multiview Network Embedding
    Zhang, Mengqi
    Zhu, Yanqiao
    Liu, Qiang
    Wu, Shu
    Wang, Liang
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4692 - 4696
  • [27] Deep Neural Architecture with Character Embedding for Semantic Frame Detection
    Daha, Fatima Zohra
    Hewavitharana, Sanjika
    2019 13TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2019, : 302 - 307
  • [28] Scene Text Recognition by Attention Network with Gated Embedding
    Wang, Cong
    Liu, Cheng-Lin
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [29] Multi-attention deep neural network fusing character and word embedding for clinical and biomedical concept extraction
    Fan, Shengyu
    Yu, Hui
    Cai, Xiaoya
    Geng, Yanfang
    Li, Guangzhen
    Xu, Weizhi
    Wang, Xia
    Yang, Yaping
    INFORMATION SCIENCES, 2022, 608 : 778 - 793
  • [30] Semi-supervised network embedding with text information
    Gong, Maoguo
    Yao, Chuanyu
    Xie, Yu
    Xu, Mingliang
    PATTERN RECOGNITION, 2020, 104