Multi-script text versus non-text classification of regions in scene images

被引:11
|
作者
Sriman, Bowornrat [1 ]
Schomaker, Lambert [1 ]
机构
[1] Univ Groningen, Artificial Intelligence, Nijenborgh 9, NL-9747 AG Groningen, Netherlands
关键词
Text detection in scene images; Text/non-text classification; Color features; Color histogram autocorrelation; SCALE; RECOGNITION;
D O I
10.1016/j.jvcir.2019.04.007
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text versus non-text region classification is an essential but difficult step in scene-image analysis due to the considerable shape complexity of text and background patterns. There exists a high probability of confusion between background elements and letter parts. This paper proposes a feature-based classification of image blocks using the color autocorrelation histogram (CAH) and the scale-invariant feature transform (SIFT) algorithm, yielding a combined scale and color-invariant feature suitable for scene-text classification. For the evaluation, features were extracted from different color spaces, applying color-histogram autocorrelation. The color features are adjoined with a SIFT descriptor. Parameter tuning is performed and evaluated. For the classification, a standard nearest-neighbor (INN) and a support vector machine (SVM) were compared. The proposed method appears to perform robustly and is especially suitable for Asian scripts such as Kannada and Thai, where urban scene-text fonts are characterized by a high curvature and salient color variations. (C) 2019 Published by Elsevier Inc.
引用
收藏
页码:23 / 42
页数:20
相关论文
共 50 条
  • [41] Readability of Non-Text Images on the World Wide Web (WWW)
    Elahi, Ehsan
    Iglesias, Ana
    Morato, Jorge
    IEEE ACCESS, 2022, 10 : 116627 - 116634
  • [42] Residual attention-based multi-scale script identification in scene text images
    Ma, Mengkai
    Wang, Qiu-Feng
    Huang, Shan
    Huang, Shen
    Goulermas, Yannis
    Huang, Kaizhu
    NEUROCOMPUTING, 2021, 421 : 222 - 233
  • [43] Text and Non-text Separation in Handwritten Document Images Using Local Binary Pattern Operator
    Bhowmik, Showmik
    Sarkar, Ram
    Nasipuri, Mita
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND COMMUNICATION, 2017, 458 : 507 - 515
  • [44] Multi-Script-Oriented Text Detection and Recognition in Video/Scene/Born Digital Images
    Raghunandan, K. S.
    Shivakumara, Palaiahnakote
    Roy, Sangheeta
    Kumar, G. Hemantha
    Pal, Umapada
    Lu, Tong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (04) : 1145 - 1162
  • [45] Residual attention-based multi-scale script identification in scene text images
    Ma M.
    Wang Q.-F.
    Huang S.
    Huang S.
    Goulermas Y.
    Huang K.
    Neurocomputing, 2021, 421 : 222 - 233
  • [46] Application of texture-based features for text non-text classification in printed document images with novel feature selection algorithm
    Soulib Ghosh
    S. K. Khalid Hassan
    Ali Hussain Khan
    Ankur Manna
    Showmik Bhowmik
    Ram Sarkar
    Soft Computing, 2022, 26 : 891 - 909
  • [47] A Chinese Document Layout Analysis Based on Non-text Images
    Fu Xiaoling
    Li Xiaofeng
    2009 INTERNATIONAL FORUM ON COMPUTER SCIENCE-TECHNOLOGY AND APPLICATIONS, VOL 1, PROCEEDINGS, 2009, : 326 - 328
  • [48] Application of texture-based features for text non-text classification in printed document images with novel feature selection algorithm
    Ghosh, Soulib
    Hassan, S. K. Khalid
    Khan, Ali Hussain
    Manna, Ankur
    Bhowmik, Showmik
    Sarkar, Ram
    SOFT COMPUTING, 2022, 26 (02) : 891 - 909
  • [49] Text detection, recognition, and script identification in natural scene images: a Review
    Veronica Naosekpam
    Nilkanta Sahu
    International Journal of Multimedia Information Retrieval, 2022, 11 : 291 - 314
  • [50] Text detection, recognition, and script identification in natural scene images: a Review
    Naosekpam, Veronica
    Sahu, Nilkanta
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2022, 11 (03) : 291 - 314