A real-time and effective text detection method for multi-scale and fuzzy text

被引:2
|
作者
Tong, Guoxiang [1 ]
Dong, Ming [1 ]
Song, Yan [1 ]
机构
[1] Univ Shanghai Sci & Technol, Dept Opt Elect & Comp Engn, Shanghai 200093, Peoples R China
关键词
Natural scene text detection; Attention mechanism; Feature path augmentation; CIoU loss; SCENE; ACCURATE;
D O I
10.1007/s11554-023-01267-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The text in the natural scene can be in various forms, dynamic blur and geometric perspective greatly affect the efficiency of text detection. Given the above situation, a real-time and effective text detection method is proposed to detect the multi-scale and fuzzy text. This method applies a convolutional attention mechanism to the feature extraction backbone to obtain more valuable text feature maps. To fully utilize the precise text location signals of the low-level features, a bottom-up path augmentation is used simultaneously. Besides, a few layers of the Resnet-50 backbone are cancelled to further shorten information communication path for balancing the speed and accuracy of detection. For text detection results, the four vertex coordinate values of the text boxes are regressed with the assistance of CIoU loss and shrinkage of text labels. Our model can effectively process an image in the fastest time of 112 ms and has a higher comprehensive indicator value than the other comparative models in ICDAR 2013, ICDAR 2015, and MSRA-TD500 datasets.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] A real-time and effective text detection method for multi-scale and fuzzy text
    Guoxiang Tong
    Ming Dong
    Yan Song
    Journal of Real-Time Image Processing, 2023, 20
  • [2] A Real-time Detection Method for Multi-scale Pedestrians in Complex Environment
    Zhou Weina
    Sun Lihua
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (07) : 2063 - 2070
  • [3] Hierarchical Feature Fusion With Text Attention For Multi-scale Text Detection
    Liu, Chao
    Zou, Yuexian
    Guan, Wenjie
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
  • [4] Text kernel expansion for real-time scene text detection
    He, Tao
    Huang, Sheng
    Tang, Wenhao
    Liu, Bo
    PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (04)
  • [5] UFNet: A Multi-scale Fusion Feature based Text Detection Method
    Chai, Zhengpeng
    Zhu, Rui
    Wang, Wei
    2023 THE 6TH INTERNATIONAL CONFERENCE ON ROBOT SYSTEMS AND APPLICATIONS, ICRSA 2023, 2023, : 163 - 168
  • [6] Real-Time Action Detection Method based on Multi-Scale Spatiotemporal Feature
    Miao, Xin
    Ke, Xiao
    2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 245 - 248
  • [7] Multi-scale ResNet for real-time underwater object detection
    Pan, Tien-Szu
    Huang, Huang-Chu
    Lee, Jen-Chun
    Chen, Chung-Hsien
    SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (05) : 941 - 949
  • [8] Multi-scale ResNet for real-time underwater object detection
    Tien-Szu Pan
    Huang-Chu Huang
    Jen-Chun Lee
    Chung-Hsien Chen
    Signal, Image and Video Processing, 2021, 15 : 941 - 949
  • [9] Real-Time Vehicle Object Detection Method Based on Multi-Scale Feature Fusion
    Guo, Keyou
    Li, Xue
    Zhang, Mo
    Bao, Qichao
    Yang, Min
    IEEE Access, 2021, 9 : 115126 - 115134
  • [10] Real-Time and Efficient Multi-Scale Traffic Sign Detection Method for Driverless Cars
    Wang, Xuan
    Guo, Jian
    Yi, Jinglei
    Song, Yongchao
    Xu, Jindong
    Yan, Weiqing
    Fu, Xin
    SENSORS, 2022, 22 (18)