Natural scene text localization and detection using MSER and its variants: a comprehensive survey

被引:1
|
作者
Dutta, Kalpita [1 ]
Sarkhel, Ritesh [2 ]
Kundu, Mahantapas [1 ]
Nasipuri, Mita [1 ]
Das, Nibaran [1 ]
机构
[1] Jadavpur Univ, Dept Comp Sci & Engn, Kolkata, India
[2] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
关键词
Text localization; Text recognition; MSER; Stroke width; Enhanced MSER; Multilevel MSER; MSER survey; NEURAL-NETWORK; IMAGES; RECOGNITION; VIDEO; SEGMENTATION; ALGORITHM; CONTEXT; MODEL;
D O I
10.1007/s11042-023-17671-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text localization and detection within natural scene images have generated significant interest among researchers due to their inherent complexity and various real-life applications. In the last few decades, various methodologies have been developed for localization and detection of wild scene text regions. Among them, Maximally Stable Extremal Regions (MSER) based techniques have achieved remarkable success in a significant variety of text localization tasks over the last decade. MSER is a well-known blob detection method, which has been applied with some modifications in many scene text-related researches. In this paper, we have reviewed and evaluated the concept of MSER methods which are combined with traditional machine learning-based methods using hand-crafted features or deep learning-based methods using automatic feature learning for scene text localization. Different MSER methods, such as standard MSER, MSER with stroke width transform, eMSER, enhanced MSER, multi-level MSER, MSER with CNN features, component splitting with MSER tree, MSER with CNN and CRF, CE-MSER have been described in this study. Finally, we have compared and evaluated the performances of those different types of MSER methods on five publicly available standard scene text datasets, like ICDAR 2003, ICDAR 2013, ICDAR 2015, KAIST, and SVT and provided the insights of appropriate selection of MSER method along with its pros and cons.
引用
收藏
页码:55773 / 55810
页数:38
相关论文
共 50 条
  • [31] Scene Text Localization Using Keypoints
    Erdogmus, Nesli
    Ozuysal, Mustafa
    2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 1917 - 1920
  • [32] A Novel Study on Localization in Scene Text Detection
    Sonsare, Pravinkumar
    Jain, Rushabh
    Runwal, Rutuj
    Dave, Kunal
    Banode, Ashutosh
    INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2023, 14 (01): : 8 - 15
  • [33] An Enhanced MSER Pruning Algorithm for Detection and Localization of Bangla Texts from Scene Images
    Islam, Rashedul
    Islam, Rafiqul
    Talukder, Kamrul
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2020, 17 (03) : 375 - 385
  • [34] Text Detection Model for Historical Documents Using CNN and MSER
    Li, Rankang
    Chen, Shanxiong
    Zhao, Fujia
    Qiu, Xiaogang
    JOURNAL OF DATABASE MANAGEMENT, 2023, 34 (01)
  • [35] Variation of Stability Factor of MSERs for Text Detection and Localization in Natural Scene Image Using Naive Bayes Classifier
    Soni, Rituraj
    Kumar, Bijendra
    Chand, Satish
    INFORMATION, COMMUNICATION AND COMPUTING TECHNOLOGY, ICICCT 2018, 2019, 835 : 192 - 206
  • [36] Thai Text Localization in Natural Scene Images using Convolutional Neural Network
    Kobchaisawat, Thananop
    Chalidabhongse, Thanarat H.
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [37] An Improved Text Localization Method for Natural Scene Images
    Jiang Mengdi
    Cheng Jianghua
    Chen Minghui
    Ku Xishu
    2017 2ND INTERNATIONAL CONFERENCE ON COMMUNICATION, IMAGE AND SIGNAL PROCESSING (CCISP 2017), 2018, 960
  • [38] SCENE TEXT DETECTION BASED ON PRUNING STRATEGY OF MSER-TREES AND LINKAGE-TREES
    Ma, Jin
    Wang, Weiqiang
    Lu, Ke
    Zhou, Jianshe
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 367 - 372
  • [39] Text Detection in Natural Scene Images Using Two Masks Filtering
    Turki, Houssem
    Ben Halima, Mohamed
    Alimi, Adel M.
    2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
  • [40] Bilingual text detection in natural scene images using invariant moments
    Maheshwari, Karan
    Raj, Alex Noel Joseph
    Mahesh, Vijayalakshmi G. V.
    Zhuang, Zhemin
    Rufus, Elizabeth
    Shivakumara, Palaiahnakote
    Naik, Ganesh R.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (05) : 6773 - 6784