Automatic text extraction in news images using morphology

被引:1
|
作者
Jang, IY [1 ]
Ko, BC [1 ]
Byun, H [1 ]
Choi, YW [1 ]
机构
[1] Yonsei Univ, Dept Comp Sci, Visual Informat Proc Lab, Seodaemun Gu, Seoul 120749, South Korea
关键词
text extraction; video indexing; morphology;
D O I
10.1117/12.453094
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper we present a new method to extract both superimposed and embedded graphical texts in a freeze-frame of news video. The algorithm is summarized in the following three steps. For the first step, we convert a color image into a gray-level image and apply contrast stretching to enhance the contrast of the input image. Then, a modified local adaptive thresholding is applied to the contrast-stretched image. The second step is divided into three processes: eliminating text-like components by applying erosion, dilation, and (OpenClose + CloseOpen)/2 morphological operations, maintaining text components using (OpenClose + CloseOpen)/2 operation with a new Geo-correction method, and subtracting two result images for eliminating false-positive components further. In the third filtering step, the characteristics of each component such as the ratio of the number of pixels in each candidate component to the number of its boundary pixels and the ratio of the minor to the major axis of each bounding box are used. Acceptable results have been obtained using the proposed method on 300 news images with a recognition rate of 93.6%. Also, our method indicates a good performance on all the various kinds of images by adjusting the size of the structuring element.
引用
收藏
页码:521 / 530
页数:10
相关论文
共 50 条
  • [1] Text extraction in digital news video using morphology
    Byun, H
    Jang, I
    Choi, Y
    DOCUMENT ANALYSIS SYSTEM V, PROCEEDINGS, 2002, 2423 : 341 - 352
  • [2] Automatic Person Information Extraction Using Overlay Text in Television News Interview Videos
    Lee, Sanghee
    Jo, Kanghyun
    2017 IEEE 15TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2017, : 583 - 588
  • [3] Automatic Definition Extraction and Crosswords Generation From News Text
    Esteche, Jennifer
    Romero, Rornina
    Chiruzzo, Luis
    Rosa, Aiala
    PROCEEDINGS OF THE 2016 XLII LATIN AMERICAN COMPUTING CONFERENCE (CLEI), 2016,
  • [4] AUTOMATIC TEXT EXTRACTION, REMOVAL AND INPAINTING OF COMPLEX DOCUMENT IMAGES
    Chen, Yen-Lin
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2012, 8 (1A): : 303 - 327
  • [5] Techniques and challenges of automatic text extraction in complex images: A survey
    Sumathi, C.P.
    Santhanam, T.
    Priya, N.
    Journal of Theoretical and Applied Information Technology, 2012, 35 (02): : 225 - 235
  • [6] Automatic Stent Segmentation in IOCT images using Combined Feature Extraction Techniques and Mathematical Morphology
    Moraes, Matheus Cardoso
    Cardona Cardenas, Diego Armando
    Furuie, Sergio Shiguemi
    2013 COMPUTING IN CARDIOLOGY CONFERENCE (CINC), 2013, 40 : 1215 - 1218
  • [7] Automatic Web News Extraction Using Blocking Tag
    Lin Ziyi
    Shen Beijun
    Tang Xinhuai
    Chen Delai
    2009 SECOND INTERNATIONAL CONFERENCE ON MACHINE VISION, PROCEEDINGS, ( ICMV 2009), 2009, : 74 - +
  • [8] Automatic Extraction of Text and Non-text Information Directly from Compressed Document Images
    Javed, Mohammed
    Nagabhushan, P.
    Chaudhuri, Bidyut B.
    PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS 2016), 2017, 552 : 38 - 46
  • [9] Automatic Text Summarization of News Articles
    Sethi, Prakhar
    Sonawane, Sameer
    Khanwalker, Saumitra
    Keskar, R. B.
    2017 INTERNATIONAL CONFERENCE ON BIG DATA, IOT AND DATA SCIENCE (BID), 2017, : 23 - 29
  • [10] Automatic text categorization of news articles
    Amasyali, MF
    Yildirim, T
    PROCEEDINGS OF THE IEEE 12TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, 2004, : 224 - 226