Robust Localization of Texts in Real-World Images

被引:6
|
作者
Ghanei, Shaho [1 ]
Faez, Karim [1 ]
机构
[1] Amirkabir Univ Technol, Dept Elect Engn, Tehran 15914, Iran
关键词
Scene text localization; closed boundary; natural images; luminance contrast; content-based image retrieval; visual impairment assistance system; HYBRID APPROACH; SCENE; ALGORITHM; STEREO;
D O I
10.1142/S0218001415550125
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Localization of texts in natural images could be an important stage in many applications such as content-based image retrieval, visual impairment assistance systems, automatic robot navigation in urban environments and tourist assistance systems. However due to the variations of font, script, scale, orientations, color, shadow and lighting conditions, robust scene text localization is still a challenging task. In this paper, we propose a novel method to localize not only Farsi/Arabic and Latin texts with different sizes, fonts and orientations but also low luminance contrast and poor quality ones in the natural images taken with uneven illumination conditions. Firstly, fast weighted median filtering as a nonlinear edge-preserving smoothing filter and then color contrast preserving decolorization are exploited to make the text localization system more robust for low luminance contrast and poor quality texts. In order to extract the Farsi/Arabic and Latin scene texts and also filter the nontext ones, a unified framework is proposed incorporating the maximally stable extremal regions and a novel proposed region detector called Stable Width Stroke Regions which is based on closed boundary regions. Phase congruency and Laplacian operators are exploited to extract the closed boundary regions. Finally, to extract the single text lines, the Meanshift clustering and radon transform were used. Experimental results show that the proposed method localize low luminance contrast and low quality scene texts for both Farsi/Arabic and Latin scripts encouragingly.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] Robust speaker localization for real-world robots
    Athanasopoulos, Georgios
    Verhelst, Werner
    Sahli, Hichem
    COMPUTER SPEECH AND LANGUAGE, 2015, 34 (01): : 129 - 153
  • [2] Co-localization in Real-World Images
    Tang, Kevin
    Joulin, Armand
    Li, Li-Jia
    Li Fei-Fei
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1464 - 1471
  • [3] Robust skin detection in real-world images
    Huang, Lei
    Ji, Wen
    Wei, Zhiqiang
    Chen, Bo-Wei
    Yan, Chenggang Clarence
    Nie, Jie
    Yin, Jian
    Jiang, Baochen
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2015, 29 : 147 - 152
  • [4] A Method for Text Localization and Recognition in Real-World Images
    Neumann, Lukas
    Matas, Jiri
    COMPUTER VISION - ACCV 2010, PT III, 2011, 6494 : 770 - 783
  • [5] LOGO RECOGNITION AND LOCALIZATION IN REAL-WORLD IMAGES BY USING VISUAL PATTERNS
    Chu, Wei-Ta
    Lin, Tsung-Che
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 973 - 976
  • [6] HIDING IMAGES INTO IMAGES WITH REAL-WORLD ROBUSTNESS
    Ying, Qichao
    Zhou, Hang
    Zeng, Xianhan
    Xu, Haisheng
    Qian, Zhenxing
    Zhang, Xinpeng
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 111 - 115
  • [7] Automatic Localization of Lung Opacity in Chest CT Images - A Real-World Study
    Xie, Yiting
    Rajan, Deepta
    Schudlo, Larissa
    Takeuchi, Yusuke
    Graf, Benedikt
    Coy, Adam
    Negahdar, Mohammadreza
    Mukherjee, Vandana
    Beymer, David
    Krishnan, Arun
    MEDICAL IMAGING 2021: COMPUTER-AIDED DIAGNOSIS, 2021, 11597
  • [8] Text Localization in Real-world Images using Efficiently Pruned Exhaustive Search
    Neumann, Lukas
    Matas, Jiri
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 687 - 691
  • [9] Attention based CNN model for fire detection and localization in real-world images
    Majid, Saima
    Alenezi, Fayadh
    Masood, Sarfaraz
    Ahmad, Musheer
    Gunduz, Emine Selda
    Polat, Kemal
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 189
  • [10] Mathematical modeling of real-world images
    Sendov, B
    CONSTRUCTIVE APPROXIMATION, 1996, 12 (01) : 31 - 65