MOSTL: An Accurate Multi-Oriented Scene Text Localization

被引:0
|
作者
Fatemeh Naiemi
Vahid Ghods
Hassan Khalesi
机构
[1] Semnan Branch,Department of Electronic Engineering
[2] Islamic Azad University,Department of Electronic Engineering
[3] Garmsar Branch,undefined
[4] Islamic Azad University,undefined
来源
Circuits, Systems, and Signal Processing | 2021年 / 40卷
关键词
Scene text localization; Object detection; Multi-oriented; Convolutional neural network; Improved inception layer; Improved ReLU layer; Curved text;
D O I
暂无
中图分类号
学科分类号
摘要
Automatic text localization in natural environments is the main element of many applications including self-driving cars, identifying vehicles, and providing scene information to visually impaired people. However, text in the natural and irregular scene has different degrees in orientations, shapes, and colors that make it difficult to detect. In this paper, an accurate multi-oriented scene text localization (MOSTL) is presented to obtain high efficiency of detecting text-based on convolutional neural networks. In the proposed method, an improved ReLU layer (i.ReLU) and an improved inception layer (i.inception) were introduced. Firstly, the proposed structure is used to extract low-level visual features. Then, an extra layer has been used to improve the feature extraction. The i.ReLU and i.inception layers have improved valuable information in text detection. The i.ReLU layers cause to extract some low-level features appropriately. The i.inception layers (specially 3 × 3 convolutions) can obtain broadly varying-sized text more effectively than a linear chain of convolution layer (without inception layers). The output of i.ReLU layers and i.inception layers was fed to an extra layer, which enables MOSTL to detect multi-oriented even curved and vertical texts. We conducted text detection experiments on well-known databases including ICDAR 2019, ICDAR 2017, ICDAR 2015, ICDAR 2003, and MSRA-TD500. MOSTL results yielded performance improvement remarkably.
引用
收藏
页码:4452 / 4473
页数:21
相关论文
共 50 条
  • [1] MOSTL: An Accurate Multi-Oriented Scene Text Localization
    Naiemi, Fatemeh
    Ghods, Vahid
    Khalesi, Hassan
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2021, 40 (09) : 4452 - 4473
  • [2] MOST: A Multi-Oriented Scene Text Detector with Localization Refinement
    He, Minghang
    Liao, Minghui
    Yang, Zhibo
    Zhong, Humen
    Tang, Jun
    Cheng, Wenqing
    Yao, Cong
    Wang, Yongpan
    Bai, Xiang
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8809 - 8818
  • [3] MULTI-ORIENTED TEXT DETECTION IN SCENE IMAGES
    Basavanna, M.
    Shivakumara, P.
    Srivatsa, S. K.
    Kumar, G. Hemantha
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (07)
  • [4] Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation
    Lyu, Pengyuan
    Yao, Cong
    Wu, Wenhao
    Yan, Shuicheng
    Bai, Xiang
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7553 - 7563
  • [5] Multi-oriented Scene Text Detector with Atrous Convolution
    Pan, Di
    Yu, Fei
    Li, Chunguo
    Yang, Luxi
    2020 INFORMATION COMMUNICATION TECHNOLOGIES CONFERENCE (ICTC), 2020, : 346 - 350
  • [6] Fused Text Segmentation Networks for Multi-oriented Scene Text Detection
    Dai, Yuchen
    Huang, Zheng
    Gao, Yuting
    Xu, Youxuan
    Chen, Kai
    Guo, Jie
    Qiu, Weidong
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3604 - 3609
  • [7] An Intelligent and Robust Multi-Oriented Image Scene Text Detection
    Sayahi, Salem
    Ben Halima, Mohamed
    2014 6TH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2014, : 418 - 422
  • [8] A Character Flow Framework for Multi-Oriented Scene Text Detection
    Yang, Wen-Jun
    Zou, Bei-Ji
    Li, Kai-Wen
    Liu, Shu
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2021, 36 (03) : 465 - 477
  • [9] The Keywords Spotting with Context for Multi-Oriented Chinese Scene Text
    Wu, Dao
    Wang, Rui
    Tian, Xiaowei
    Liang, Dong
    Cao, Xiaochun
    2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2018,
  • [10] Deep Direct Regression for Multi-Oriented Scene Text Detection
    He, Wenhao
    Zhang, Xu-Yao
    Yin, Fei
    Liu, Cheng-Lin
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 745 - 753