MOSTL: An Accurate Multi-Oriented Scene Text Localization

被引:0
|
作者
Fatemeh Naiemi
Vahid Ghods
Hassan Khalesi
机构
[1] Semnan Branch,Department of Electronic Engineering
[2] Islamic Azad University,Department of Electronic Engineering
[3] Garmsar Branch,undefined
[4] Islamic Azad University,undefined
来源
Circuits, Systems, and Signal Processing | 2021年 / 40卷
关键词
Scene text localization; Object detection; Multi-oriented; Convolutional neural network; Improved inception layer; Improved ReLU layer; Curved text;
D O I
暂无
中图分类号
学科分类号
摘要
Automatic text localization in natural environments is the main element of many applications including self-driving cars, identifying vehicles, and providing scene information to visually impaired people. However, text in the natural and irregular scene has different degrees in orientations, shapes, and colors that make it difficult to detect. In this paper, an accurate multi-oriented scene text localization (MOSTL) is presented to obtain high efficiency of detecting text-based on convolutional neural networks. In the proposed method, an improved ReLU layer (i.ReLU) and an improved inception layer (i.inception) were introduced. Firstly, the proposed structure is used to extract low-level visual features. Then, an extra layer has been used to improve the feature extraction. The i.ReLU and i.inception layers have improved valuable information in text detection. The i.ReLU layers cause to extract some low-level features appropriately. The i.inception layers (specially 3 × 3 convolutions) can obtain broadly varying-sized text more effectively than a linear chain of convolution layer (without inception layers). The output of i.ReLU layers and i.inception layers was fed to an extra layer, which enables MOSTL to detect multi-oriented even curved and vertical texts. We conducted text detection experiments on well-known databases including ICDAR 2019, ICDAR 2017, ICDAR 2015, ICDAR 2003, and MSRA-TD500. MOSTL results yielded performance improvement remarkably.
引用
收藏
页码:4452 / 4473
页数:21
相关论文
共 50 条
  • [21] SCALE-INVARIANT MULTI-ORIENTED TEXT DETECTION IN WILD SCENE IMAGE
    Dasgupta, Kinjal
    Das, Sudip
    Bhattacharya, Ujjwal
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2041 - 2045
  • [22] FC2RN: A FULLY CONVOLUTIONAL CORNER REFINEMENT NETWORK FOR ACCURATE MULTI-ORIENTED SCENE TEXT DETECTION
    Qin, Xugong
    Zhou, Yu
    Guo, Youhui
    Wu, Dayan
    Wang, Weiping
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4350 - 4354
  • [23] Multi-Oriented Moving Text Detection
    Khare, Vijeta
    Shivakumara, Palaiahnakote
    Raveendran, Paramesaran
    2014 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2014, : 347 - 352
  • [24] A comparative approach on detecting multi-lingual and multi-oriented text in natural scene images
    Yegnaraman, Aparna
    Valli, S.
    APPLIED INTELLIGENCE, 2021, 51 (06) : 3696 - 3717
  • [25] A comparative approach on detecting multi-lingual and multi-oriented text in natural scene images
    Aparna Yegnaraman
    S. Valli
    Applied Intelligence, 2021, 51 : 3696 - 3717
  • [26] Location Sensitive Regression Algorithm for Multi-Oriented Scene Text Detection with Focal Loss
    Kuang, Hailan
    Li, Zheng
    Ma, Xiaolin
    Liu, Xinhua
    2019 11TH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2019), 2019, : 462 - 466
  • [27] Multi-oriented scene text detection by fixed-width multi-ratio rotation anchors
    Zou, Beiji
    Yang, Wenjun
    Liu, Shu
    Jiang, Lingzi
    COMPUTERS & ELECTRICAL ENGINEERING, 2021, 95
  • [28] Multi-oriented English text line identification
    Pal, U
    Sinha, S
    Chaudhuri, BB
    IMAGE ANALYSIS, PROCEEDINGS, 2003, 2749 : 1146 - 1153
  • [29] Recognition of Indian multi-oriented and curved text
    Pal, U
    Tripathy, N
    EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 141 - 145
  • [30] Multi-oriented Bangla and Devnagari text recognition
    Pal, Umapada
    Roy, Partha Pratim
    Tripathy, Nilamadhaba
    Llados, Josep
    PATTERN RECOGNITION, 2010, 43 (12) : 4124 - 4136