A Robust Algorithm for Text Extraction from Images

被引:0
|
作者
Chidiac, Najwa-Maria [1 ]
Damien, Pascal [1 ]
Yaacoub, Charles [1 ]
机构
[1] Holy Spirit Univ Kaslik USEK, Fac Engn, POB 446, Jounieh, Lebanon
关键词
MSER; OCR; Segmentation; SWD; Text Extraction; SEGMENTATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A robust algorithm that detects text from natural scene images and extracts them regardless of the orientation is proposed. All existing methods are designed to operate under a certain constraint, like detecting text only in one direction. Maximally Stable Extremal Regions (MSER) detector is chosen to extract binary regions since it has proven to be robust to lighting conditions. An enhancement technique for MSER images is designed to obtain clear letter boundaries. Images are then fed into a Stroke Width Detector and several heuristics are applied to remove non-text pixels. Afterwards, detected text regions are fed into an Optical Character Recognition module and then filtered according to their confidence measure. The recognition of characters is not part of the algorithm and the results are only about the detection of text. Our algorithm proved to be effective on blurred images and noisy images as well, based on both subjective and objective evaluations.
引用
收藏
页码:493 / 497
页数:5
相关论文
共 50 条
  • [21] Text information extraction algorithm of video images in multimedia environment
    Mao, Chun
    Hu, Xiao
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2018, 21 (02): : 305 - 310
  • [22] FPGA architecture for text extraction from images
    O. Vignesh
    H. Mangalam
    S. Gayathri
    Cluster Computing, 2019, 22 : 12137 - 12146
  • [23] FPGA architecture for text extraction from images
    Vignesh, O.
    Mangalam, H.
    Gayathri, S.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 5): : 12137 - 12146
  • [24] A Robust and Invariant Keypoint Extraction Algorithm in Brain MR Images
    Sarikhani, Hossein
    Abdollahian, Ebrahim
    Shirpour, Mohsen
    Javaheri, Alireza
    Manzuri, Mohammad Taghi
    ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING, AISP 2013, 2014, 427 : 121 - 130
  • [25] A contour-based robust algorithm for text detection in color images
    Liu, YX
    Goto, S
    Ikenaga, T
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (03): : 1221 - 1230
  • [26] Preprocessing Techniques for High Quality Text Extraction from Text Images
    Koshy, Alan
    Balakumar, Niranj M. J.
    Shyna, A.
    John, Ansamma
    PROCEEDINGS OF 2019 1ST INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION AND COMMUNICATION TECHNOLOGY (ICIICT 2019), 2019,
  • [27] An efficient ROI detection algorithm for Bangla text extraction and recognition from natural scene images
    Islam, Rashedul
    Islam, Md. Rafiqul
    Talukder, Kamrul Hasan
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (08) : 6150 - 6164
  • [28] Extraction Algorithm of English Text Information From Color Images Based on Radial Wavelet Transform
    Wang, Yaqin
    IEEE ACCESS, 2020, 8 (08): : 160050 - 160064
  • [29] Based on Improved Edge Detection Algorithm for English Text Extraction and Restoration From Color Images
    Xu, Jianbo
    Ding, Wenhan
    Zhao, Hanbing
    IEEE SENSORS JOURNAL, 2020, 20 (20) : 11951 - 11958
  • [30] Robust text detection from binarized document images
    Okun, O
    Yan, Y
    Pietikäinen, M
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL III, PROCEEDINGS, 2002, : 61 - 64