Detecting Text in the Wild with Deep Character Embedding Network

被引:3
|
作者
Li, Jiaming [1 ]
Zhang, Chengquan [1 ]
Sun, Yipeng [1 ]
Han, Junyu [1 ]
Ding, Errui [1 ]
机构
[1] Baidu Inc, Beijing, Peoples R China
来源
关键词
Text detection; Character detection; Embedding learning;
D O I
10.1007/978-3-030-20870-7_31
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Most text detection methods hypothesize texts are horizontal or multi-oriented and thus define quadrangles as the basic detection unit. However, text in the wild is usually perspectively distorted or curved, which can not be easily tackled by existing approaches. In this paper, we propose a deep character embedding network (CENet) which simultaneously predicts the bounding boxes of characters and their embedding vectors, thus making text detection a simple clustering task in the character embedding space. The proposed method does not require strong assumptions of forming a straight line on general text detection, which provides flexibility on arbitrarily curved or perspectively distorted text. For character detection task, a dense prediction subnetwork is designed to obtain the confidence score and bounding boxes of characters. For character embedding task, a subnet is trained with contrastive loss to project detected characters into embedding space. The two tasks share a backbone CNN from which the multi-scale feature maps are extracted. The final text regions can be easily achieved by a thresholding process on character confidence and embedding distance of character pairs. We evaluated our method on ICDAR13, ICDAR15, MSRA-TD500, and Total Text. The proposed method achieves state-of-the-art or comparable performance on all of the datasets, and shows a substantial improvement in the irregular-text datasets, i.e. Total-Text.
引用
收藏
页码:501 / 517
页数:17
相关论文
共 50 条
  • [1] Character level embedding with deep convolutional neural network for text normalization of unstructured data for Twitter sentiment analysis
    Arora, Monika
    Kansal, Vineet
    SOCIAL NETWORK ANALYSIS AND MINING, 2019, 9 (01)
  • [2] Character level embedding with deep convolutional neural network for text normalization of unstructured data for Twitter sentiment analysis
    Monika Arora
    Vineet Kansal
    Social Network Analysis and Mining, 2019, 9
  • [3] Detecting and Removing Text in the Wild
    Cho, Junho
    Yun, Sangdoo
    Han, Dongyoon
    Heo, Byeongho
    Choi, Jin Young
    IEEE ACCESS, 2021, 9 : 123313 - 123323
  • [4] Detecting Tampered Scene Text in the Wild
    Wang, Yuxin
    Xie, Hongtao
    Xing, Mengting
    Wang, Jing
    Zhu, Shenggao
    Zhang, Yongdong
    COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 215 - 232
  • [5] Text Position-Aware Pixel Aggregation Network With Adaptive Gaussian Threshold: Detecting Text in the Wild
    Xu, Jiayu
    Lin, Ailiang
    Li, Jinxing
    Lu, Guangming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 286 - 298
  • [6] ScrutNet: a deep ensemble network for detecting fake news in online text
    Verma, Aryan
    Priyanka, P.
    Khan, Tayyab
    Singh, Karan
    Yesufu, Lawal . O.
    Ariffin, Mazeyanti Mohd
    Ahmadian, Ali
    SOCIAL NETWORK ANALYSIS AND MINING, 2025, 15 (01)
  • [7] Enhanced Network Embedding with Text Information
    Yang, Shuang
    Yang, Bo
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 326 - 331
  • [8] Dynamically Jointing character and word embedding for Chinese text Classification
    Tang, Xuetao
    Hu, Xuegang
    Li, Peipei
    11TH IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE GRAPH (ICKG 2020), 2020, : 336 - 343
  • [9] An Improved Deep Learning Network Structure for Multitask Text Implication Translation Character Recognition
    Ma, Xiaoli
    Xu, Hongyan
    Zhang, Xiaoqian
    Wang, Haoyong
    COMPLEXITY, 2021, 2021
  • [10] Deep Kernel Network Embedding
    Zhang, Bo
    Zhang, Xiaoming
    Huang, Feiran
    Lu, Ming
    Ma, Shuai
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (06) : 5710 - 5723