Video OCR: indexing digital news libraries by recognition of superimposed captions

被引:84
|
作者
Sato, T
Kanade, T
Hughes, EK
Smith, MA
Satoh, S
机构
[1] Toshiba Co Ltd, Saiwai Ku, Kawasaki, Kanagawa 2108501, Japan
[2] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[3] Natl Ctr Sci Informat Syst NACSIS, Bunkyo Ku, Tokyo 1128640, Japan
关键词
digital video library; caption; index; OCR; image enhancement;
D O I
10.1007/s005300050140
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The automatic extraction and recognition of news captions and annotations can be of great help locating topics of interest in digital news video libraries. To achieve this goal, we present a technique, called Video OCR (Optical Character Reader), which detects, extracts, and reads text areas in digital video data. In this paper, we address problems, describe the method by which Video OCR operates, and suggest applications for its use in digital news archives. To solve two problems of character recognition for videos, low-resolution characters and extremely complex backgrounds, we apply an interpolation filter, multiframe integration and character extraction filters. Character segmentation is performed by a recognition-based segmentation method, and intermediate character recognition results are used to improve the segmentation. We also include a method for locating text areas using text-like properties and the use of a language-based postprocessing technique to increase word recognition rates, The overall recognition results are satisfactory for use in news indexing. Performing Video OCR on news video and combining its results with other video understanding techniques will improve the overall understanding of the news video content.
引用
收藏
页码:385 / 395
页数:11
相关论文
共 50 条
  • [1] Video OCR: indexing digital news libraries by recognition of superimposed captions
    Toshio Sato
    Takeo Kanade
    Ellen K. Hughes
    Michael A. Smith
    Shin'ichi Satoh
    Multimedia Systems, 1999, 7 : 385 - 395
  • [2] Video OCR for digital news archive
    Sato, T
    Kanade, T
    Hughes, EK
    Smith, MA
    1998 IEEE INTERNATIONAL WORKSHOP ON CONTENT-BASED ACCESS OF IMAGE AND VIDEO DATABASE, PROCEEDINGS, 1998, : 52 - 60
  • [3] Recognition of Concordances for Indexing in Digital Libraries
    Marinai, Simone
    Capobianco, Samuele
    Ziran, Zahra
    Giuntini, Andrea
    Mansueto, Pierluigi
    DIGITAL LIBRARIES: THE ERA OF BIG DATA AND DATA SCIENCE, IRCDL 2020, 2020, 1177 : 135 - 147
  • [4] Recognition and Transition Frame Detection of Arabic News Captions for Video Retrieval
    Iwata, Seiya
    Ohyama, Wataru
    Wakabayashi, Tetsushi
    Kimura, Fumitaka
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 4005 - 4010
  • [5] Detection and retrieval of captions in news video
    Luo, M
    Bai, XS
    Xu, GG
    VISUALIZATION AND OPTIMIZATION TECHNIQUES, 2001, 4553 : 233 - 238
  • [6] Superimposed Information Architecture for Digital Libraries
    Archer, David W.
    Delcambre, Lois M. L.
    Corubolo, Fabio
    Cassel, Lillian
    Price, Susan
    Murthy, Uma
    Maier, David
    Fox, Edward A.
    Murthy, Sudarshan
    McCall, John
    Kuchibhotla, Kiran
    Suryavanshi, Rahul
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, 2008, 5173 : 88 - +
  • [7] Automated video indexing of very large video libraries
    Wactlar, HD
    Hauptmann, AG
    Smith, MA
    Pendyala, KV
    Garlington, D
    SMPTE JOURNAL, 1997, 106 (08): : 524 - 529
  • [8] Recognition and Connection of Moving Captions in Arabic TV News
    Iwata, Seiya
    Ohyama, Wataru
    Wakabayashi, Tetsushi
    Kimura, Fumitaka
    2017 1ST INTERNATIONAL WORKSHOP ON ARABIC SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2017, : 163 - 167
  • [9] Multimodal Indexing of Multilingual News Video
    Ghosh, Hiranmay
    Kopparapu, Sunil Kumar
    Chattopadhyay, Tanushyam
    Khare, Ashish
    Wattamwar, Sujal Subhash
    Gorai, Amarendra
    Pandharipande, Meghna
    INTERNATIONAL JOURNAL OF DIGITAL MULTIMEDIA BROADCASTING, 2010, 2010
  • [10] Digital libraries and autonomous citation indexing
    Lawrence, S
    Giles, CL
    Bollacker, K
    COMPUTER, 1999, 32 (06) : 67 - +