MOSTL: An Accurate Multi-Oriented Scene Text Localization

被引:0
|
作者
Fatemeh Naiemi
Vahid Ghods
Hassan Khalesi
机构
[1] Semnan Branch,Department of Electronic Engineering
[2] Islamic Azad University,Department of Electronic Engineering
[3] Garmsar Branch,undefined
[4] Islamic Azad University,undefined
来源
Circuits, Systems, and Signal Processing | 2021年 / 40卷
关键词
Scene text localization; Object detection; Multi-oriented; Convolutional neural network; Improved inception layer; Improved ReLU layer; Curved text;
D O I
暂无
中图分类号
学科分类号
摘要
Automatic text localization in natural environments is the main element of many applications including self-driving cars, identifying vehicles, and providing scene information to visually impaired people. However, text in the natural and irregular scene has different degrees in orientations, shapes, and colors that make it difficult to detect. In this paper, an accurate multi-oriented scene text localization (MOSTL) is presented to obtain high efficiency of detecting text-based on convolutional neural networks. In the proposed method, an improved ReLU layer (i.ReLU) and an improved inception layer (i.inception) were introduced. Firstly, the proposed structure is used to extract low-level visual features. Then, an extra layer has been used to improve the feature extraction. The i.ReLU and i.inception layers have improved valuable information in text detection. The i.ReLU layers cause to extract some low-level features appropriately. The i.inception layers (specially 3 × 3 convolutions) can obtain broadly varying-sized text more effectively than a linear chain of convolution layer (without inception layers). The output of i.ReLU layers and i.inception layers was fed to an extra layer, which enables MOSTL to detect multi-oriented even curved and vertical texts. We conducted text detection experiments on well-known databases including ICDAR 2019, ICDAR 2017, ICDAR 2015, ICDAR 2003, and MSRA-TD500. MOSTL results yielded performance improvement remarkably.
引用
收藏
页码:4452 / 4473
页数:21
相关论文
共 50 条
  • [31] A new method for multi-oriented graphics-scene-3D text classification in video
    Xu, Jiamin
    Shivakumara, Palaiahnakote
    Lu, Tong
    Tan, Chew Lim
    Uchida, Seiichi
    PATTERN RECOGNITION, 2016, 49 : 19 - 42
  • [32] Multi-Oriented Text Extraction in Stylistic Documents
    Singh, Brij Mohan
    Sharma, Rahul
    Ghosh, Debashis
    Mittal, Ankush
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2015, 15 (01)
  • [33] Multi-oriented scene text detection in video based on wavelet and angle projection boundary growing
    Shivakumara, Palaiahnakote
    Dutta, Anjan
    Tan, Chew Lim
    Pal, Umapada
    MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 72 (01) : 515 - 539
  • [34] IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection
    Yang, Qiangpeng
    Cheng, Mengli
    Zhou, Wenmeng
    Chen, Yan
    Qiu, Minghui
    Lin, Wei
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 1071 - 1077
  • [35] Multi-oriented scene text detection in video based on wavelet and angle projection boundary growing
    Palaiahnakote Shivakumara
    Anjan Dutta
    Chew Lim Tan
    Umapada Pal
    Multimedia Tools and Applications, 2014, 72 : 515 - 539
  • [36] Recognition of Multi-Oriented, Multi-Sized, and Curved Text
    Chiang, Yao-Yi
    Knoblock, Craig A.
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 1399 - 1403
  • [37] A Laplacian Approach to Multi-Oriented Text Detection in Video
    Shivakumara, Palaiahnakote
    Phan, Trung Quy
    Tan, Chew Lim
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (02) : 412 - 419
  • [38] MASK-MOST NET: MASK APPROXIMATION BASED MULTI-ORIENTED SCENE TEXT DETECTION NETWORK
    Guo, Xiaobao
    Li, Jinxing
    Chen, Bingzhi
    Lu, Guangming
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 206 - 211
  • [39] Multi-Oriented Text Detection with Fully Convolutional Networks
    Zhang, Zheng
    Zhang, Chengquan
    Shen, Wei
    Yao, Cong
    Liu, Wenyu
    Bai, Xiang
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4159 - 4167
  • [40] Convolutional Regression Network for Multi-Oriented Text Detection
    Gao, Junyu
    Wang, Qi
    Yuan, Yuan
    IEEE ACCESS, 2019, 7 : 96424 - 96433