A real-time and effective text detection method for multi-scale and fuzzy text

被引:2
|
作者
Tong, Guoxiang [1 ]
Dong, Ming [1 ]
Song, Yan [1 ]
机构
[1] Univ Shanghai Sci & Technol, Dept Opt Elect & Comp Engn, Shanghai 200093, Peoples R China
关键词
Natural scene text detection; Attention mechanism; Feature path augmentation; CIoU loss; SCENE; ACCURATE;
D O I
10.1007/s11554-023-01267-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The text in the natural scene can be in various forms, dynamic blur and geometric perspective greatly affect the efficiency of text detection. Given the above situation, a real-time and effective text detection method is proposed to detect the multi-scale and fuzzy text. This method applies a convolutional attention mechanism to the feature extraction backbone to obtain more valuable text feature maps. To fully utilize the precise text location signals of the low-level features, a bottom-up path augmentation is used simultaneously. Besides, a few layers of the Resnet-50 backbone are cancelled to further shorten information communication path for balancing the speed and accuracy of detection. For text detection results, the four vertex coordinate values of the text boxes are regressed with the assistance of CIoU loss and shrinkage of text labels. Our model can effectively process an image in the fastest time of 112 ms and has a higher comprehensive indicator value than the other comparative models in ICDAR 2013, ICDAR 2015, and MSRA-TD500 datasets.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] A neuromorphic multi-scale approach for real-time heart rate and state detection
    Chiara De Luca
    Mirco Tincani
    Giacomo Indiveri
    Elisa Donati
    npj Unconventional Computing, 2 (1):
  • [32] Multi-scale Bidirectional Local Template Patterns for Real-time Human Detection
    Xu, Jiu
    Jiang, Ning
    Xue, Xinwei
    Sun, Heming
    Yu, Wenxin
    Goto, Satoshi
    2013 IEEE 15TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2013, : 379 - 383
  • [33] Matching Multi-Scale Features and Prediction Tasks for Real-Time Object Detection
    Du Hongjie
    Sun Hanqing
    Cao Jiale
    Pang Yanwei
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (12)
  • [34] Real-Time Detection of Multi-scale Traffic Signs Based on Decoupled Heads
    Zhang, Yang
    Wu, Chunming
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VIII, ICIC 2024, 2024, 14869 : 241 - 252
  • [35] A Multi-scale Text Line Segmentation Method in Freestyle Handwritten Documents
    Gao, Yangdong
    Ding, Xiaoqing
    Liu, Changsong
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 643 - 647
  • [36] MSER-based Real-Time Text Detection and Tracking
    Gomez, Lluis
    Karatzas, Dimosthenis
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 3110 - 3115
  • [37] Real-time Scene Text Detection Based on Stroke Model
    Liu, Yi
    Zhang, Dongming
    Zhang, Yongdong
    Lin, Shouxun
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 3116 - 3120
  • [38] An effective method in text detection and characters recognition using fuzzy theory
    Akhbari, Zeinab
    Maleki, Najme
    Zamani, Samaneh
    Yaghmaei, Mohammad H.
    2012 SIXTH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2012, : 800 - 805
  • [39] Real-time multi-scale parallel compressive tracking
    Chi-Yi Tsai
    Yen-Chang Feng
    Journal of Real-Time Image Processing, 2019, 16 : 2073 - 2091
  • [40] Real-time multi-scale parallel compressive tracking
    Tsai, Chi-Yi
    Feng, Yen-Chang
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2019, 16 (06) : 2073 - 2091