MOSTL: An Accurate Multi-Oriented Scene Text Localization

被引:0
|
作者
Fatemeh Naiemi
Vahid Ghods
Hassan Khalesi
机构
[1] Semnan Branch,Department of Electronic Engineering
[2] Islamic Azad University,Department of Electronic Engineering
[3] Garmsar Branch,undefined
[4] Islamic Azad University,undefined
来源
Circuits, Systems, and Signal Processing | 2021年 / 40卷
关键词
Scene text localization; Object detection; Multi-oriented; Convolutional neural network; Improved inception layer; Improved ReLU layer; Curved text;
D O I
暂无
中图分类号
学科分类号
摘要
Automatic text localization in natural environments is the main element of many applications including self-driving cars, identifying vehicles, and providing scene information to visually impaired people. However, text in the natural and irregular scene has different degrees in orientations, shapes, and colors that make it difficult to detect. In this paper, an accurate multi-oriented scene text localization (MOSTL) is presented to obtain high efficiency of detecting text-based on convolutional neural networks. In the proposed method, an improved ReLU layer (i.ReLU) and an improved inception layer (i.inception) were introduced. Firstly, the proposed structure is used to extract low-level visual features. Then, an extra layer has been used to improve the feature extraction. The i.ReLU and i.inception layers have improved valuable information in text detection. The i.ReLU layers cause to extract some low-level features appropriately. The i.inception layers (specially 3 × 3 convolutions) can obtain broadly varying-sized text more effectively than a linear chain of convolution layer (without inception layers). The output of i.ReLU layers and i.inception layers was fed to an extra layer, which enables MOSTL to detect multi-oriented even curved and vertical texts. We conducted text detection experiments on well-known databases including ICDAR 2019, ICDAR 2017, ICDAR 2015, ICDAR 2003, and MSRA-TD500. MOSTL results yielded performance improvement remarkably.
引用
收藏
页码:4452 / 4473
页数:21
相关论文
共 50 条
  • [41] Multi-Oriented Real-time Arabic Scene Text Detection with Deep Fully Convolutional Networks
    Sassi, M. Saifeddine Hadj
    Beltaief, Ines
    Zekri, Manel
    Ben Yahia, Sadok
    2019 IEEE/ACS 16TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA 2019), 2019,
  • [42] Semantic Compensation Based Dual-Stream Feature Interaction Network for Multi-oriented Scene Text Detection
    Wang, Siyan
    Li, Sumei
    2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
  • [43] Extraction and Recognition of Multi-oriented Text from Trademark Images
    Tripathi, Priyanka
    Indoria, Ajay Kumar
    2015 INTERNATIONAL CONFERENCE ON COGNITIVE COMPUTING AND INFORMATION PROCESSING (CCIP), 2015,
  • [44] Bayesian classifier for multi-oriented video text recognition system
    Roy, Sangheeta
    Shivakumara, Palaiahnakote
    Roy, Partha Pratim
    Pal, Umapada
    Tan, Chew Lim
    Lu, Tong
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (13) : 5554 - 5566
  • [45] Multi-Oriented Text Recognition in Graphical Documents using HMM
    Roy, Partha Pratim
    Roy, Sangheeta
    Pal, Umapada
    2014 11TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS 2014), 2014, : 136 - 140
  • [46] A general approach for multi-oriented text line extraction of handwritten documents
    Ouwayed, Nazih
    Belaid, Abdel
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2012, 15 (04) : 297 - 314
  • [47] Scene Text Localization and Recognition with Oriented Stroke Detection
    Neumann, Lukas
    Matas, Jiri
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 97 - 104
  • [48] A general approach for multi-oriented text line extraction of handwritten documents
    Nazih Ouwayed
    Abdel Belaïd
    International Journal on Document Analysis and Recognition (IJDAR), 2012, 15 : 297 - 314
  • [49] A Novel Multi-Oriented Chinese Text Extraction Approach from Videos
    Liu, Yang
    Song, Yonghong
    Zhang, Yuanlin
    Meng, Quan
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 1355 - 1359
  • [50] A new Histogram Oriented Moments descriptor for multi-oriented moving text detection in video
    Khare, Vijeta
    Shivakumara, Palaiahnakote
    Raveendran, Paramesran
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (21) : 7627 - 7640