Video OCR: indexing digital news libraries by recognition of superimposed captions

被引:84
|
作者
Sato, T
Kanade, T
Hughes, EK
Smith, MA
Satoh, S
机构
[1] Toshiba Co Ltd, Saiwai Ku, Kawasaki, Kanagawa 2108501, Japan
[2] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[3] Natl Ctr Sci Informat Syst NACSIS, Bunkyo Ku, Tokyo 1128640, Japan
关键词
digital video library; caption; index; OCR; image enhancement;
D O I
10.1007/s005300050140
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The automatic extraction and recognition of news captions and annotations can be of great help locating topics of interest in digital news video libraries. To achieve this goal, we present a technique, called Video OCR (Optical Character Reader), which detects, extracts, and reads text areas in digital video data. In this paper, we address problems, describe the method by which Video OCR operates, and suggest applications for its use in digital news archives. To solve two problems of character recognition for videos, low-resolution characters and extremely complex backgrounds, we apply an interpolation filter, multiframe integration and character extraction filters. Character segmentation is performed by a recognition-based segmentation method, and intermediate character recognition results are used to improve the segmentation. We also include a method for locating text areas using text-like properties and the use of a language-based postprocessing technique to increase word recognition rates, The overall recognition results are satisfactory for use in news indexing. Performing Video OCR on news video and combining its results with other video understanding techniques will improve the overall understanding of the news video content.
引用
收藏
页码:385 / 395
页数:11
相关论文
共 50 条
  • [11] Mathematical Symbol Indexing for Digital Libraries
    Marinai, Simone
    Miotti, Beatrice
    Soda, Giovanni
    DIGITAL LIBRARIES, 2010, 91 : 113 - 124
  • [12] Using Closed Captions as Supervision for Video Activity Recognition
    Gupta, Sonal
    Mooney, Raymond J.
    PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 1083 - 1088
  • [13] A semi-automatic adaptive OCR for digital libraries
    Rawat, S
    Kumar, KSS
    Meshesha, M
    Sikdar, ID
    Balasubramanian, A
    Jawahar, CV
    DOCUMENT ANALYSIS SYSTEMS VII, PROCEEDINGS, 2006, 3872 : 13 - 24
  • [14] Digital libraries for electronic news
    Shepherd, MA
    Watters, CR
    Burkowski, FJ
    DIGITAL LIBRARIES: RESEARCH AND TECHNOLOGY ADVANCES, 1996, 1082 : 55 - 62
  • [15] Text extraction, enhancement and OCR in digital video
    Li, HP
    Doermann, D
    Kia, O
    DOCUMENT ANALYSIS SYSTEMS: THEORY AND PRACTICE, 1999, 1655 : 363 - 377
  • [16] Transcribing broadcast news for audio and video indexing
    Gauvain, JL
    Lamel, L
    Adda, G
    COMMUNICATIONS OF THE ACM, 2000, 43 (02) : 64 - 70
  • [17] The semantic pathfinder for generic news video indexing
    Snoek, C. G. M.
    Worring, M.
    Geusebroek, J. M.
    Koelma, D. C.
    Seinstra, F. J.
    Smeulders, A. W. M.
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1469 - +
  • [18] Multimodal approach for summarizing and indexing news video
    Kim, JG
    Chang, HS
    Kim, YT
    Kang, K
    Kim, M
    Kim, J
    Kim, HM
    ETRI JOURNAL, 2002, 24 (01) : 1 - 11
  • [19] A NOVEL AUDIOVISUAL ANALYSIS FOR NEWS VIDEO INDEXING
    Huang Yubin
    Dong Yuan
    Dong Chengyu
    Wang Haila
    PROCEEDINGS OF 2009 2ND IEEE INTERNATIONAL CONFERENCE ON BROADBAND NETWORK & MULTIMEDIA TECHNOLOGY, 2009, : 486 - 490
  • [20] Color object indexing and retrieval in digital libraries
    Wei, H
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2002, 11 (08) : 912 - 922