Natural scene text localization and detection using MSER and its variants: a comprehensive survey

被引:1
|
作者
Dutta, Kalpita [1 ]
Sarkhel, Ritesh [2 ]
Kundu, Mahantapas [1 ]
Nasipuri, Mita [1 ]
Das, Nibaran [1 ]
机构
[1] Jadavpur Univ, Dept Comp Sci & Engn, Kolkata, India
[2] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
关键词
Text localization; Text recognition; MSER; Stroke width; Enhanced MSER; Multilevel MSER; MSER survey; NEURAL-NETWORK; IMAGES; RECOGNITION; VIDEO; SEGMENTATION; ALGORITHM; CONTEXT; MODEL;
D O I
10.1007/s11042-023-17671-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text localization and detection within natural scene images have generated significant interest among researchers due to their inherent complexity and various real-life applications. In the last few decades, various methodologies have been developed for localization and detection of wild scene text regions. Among them, Maximally Stable Extremal Regions (MSER) based techniques have achieved remarkable success in a significant variety of text localization tasks over the last decade. MSER is a well-known blob detection method, which has been applied with some modifications in many scene text-related researches. In this paper, we have reviewed and evaluated the concept of MSER methods which are combined with traditional machine learning-based methods using hand-crafted features or deep learning-based methods using automatic feature learning for scene text localization. Different MSER methods, such as standard MSER, MSER with stroke width transform, eMSER, enhanced MSER, multi-level MSER, MSER with CNN features, component splitting with MSER tree, MSER with CNN and CRF, CE-MSER have been described in this study. Finally, we have compared and evaluated the performances of those different types of MSER methods on five publicly available standard scene text datasets, like ICDAR 2003, ICDAR 2013, ICDAR 2015, KAIST, and SVT and provided the insights of appropriate selection of MSER method along with its pros and cons.
引用
收藏
页码:55773 / 55810
页数:38
相关论文
共 50 条
  • [21] Robust Scene Text Detection with Convolution Neural Network Induced MSER Trees
    Huang, Weilin
    Qiao, Yu
    Tang, Xiaoou
    COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 : 497 - 511
  • [22] Integrated Natural Scene Text Localization and Recognition
    Satwashil, Kakade Snehal
    Pawar, V. R.
    2017 INTERNATIONAL CONFERENCE OF ELECTRONICS, COMMUNICATION AND AEROSPACE TECHNOLOGY (ICECA), VOL 1, 2017, : 371 - 374
  • [23] Text Localization in Natural Images Through Effective Re-Identification of the MSER
    Mahmood, Hanaa F.
    Li, Baihua
    Edirisinghe, Eran
    PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON INTERNET OF THINGS AND MACHINE LEARNING (IML'17), 2017,
  • [24] TEXT DETECTION IN NATURAL SCENE IMAGES BY HIERARCHICAL LOCALIZATION AND GROWING OF TEXTUAL COMPONENTS
    Ding, Wenjun
    Shan, Susu
    Su, Feng
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 775 - 780
  • [25] Scene text detection and recognition: a survey
    Naiemi, Fatemeh
    Ghods, Vahid
    Khalesi, Hassan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (14) : 20255 - 20290
  • [26] Scene text detection and recognition: a survey
    Fatemeh Naiemi
    Vahid Ghods
    Hassan Khalesi
    Multimedia Tools and Applications, 2022, 81 : 20255 - 20290
  • [27] Text Detection using MSER and Stroke Width Transform
    Tabassum, Adiba
    Dhondse, Shweta A.
    2015 FIFTH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT2015), 2015, : 568 - 571
  • [28] Scene Text Detection Based on Enhanced Multi-channels MSER and a Fast Text Grouping Process
    Dai, Jin
    Wang, Zu
    Zhao, Xianjing
    Shao, Shuai
    2018 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2018, : 351 - 355
  • [29] AdaBoost for Text Detection in Natural Scene
    Lee, Jung-Jin
    Lee, Pyoung-Hean
    Lee, Seong-Whan
    Yuille, Alan
    Koch, Christof
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 429 - 434
  • [30] Natural Scene Text Detection using Deep Neural Networks
    Mayank
    Bhowmick, Swapnamoy
    Kotecha, Disha
    Rege, Priti P.
    2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2021,