Video OCR: indexing digital news libraries by recognition of superimposed captions

被引:84
|
作者
Sato, T
Kanade, T
Hughes, EK
Smith, MA
Satoh, S
机构
[1] Toshiba Co Ltd, Saiwai Ku, Kawasaki, Kanagawa 2108501, Japan
[2] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[3] Natl Ctr Sci Informat Syst NACSIS, Bunkyo Ku, Tokyo 1128640, Japan
关键词
digital video library; caption; index; OCR; image enhancement;
D O I
10.1007/s005300050140
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The automatic extraction and recognition of news captions and annotations can be of great help locating topics of interest in digital news video libraries. To achieve this goal, we present a technique, called Video OCR (Optical Character Reader), which detects, extracts, and reads text areas in digital video data. In this paper, we address problems, describe the method by which Video OCR operates, and suggest applications for its use in digital news archives. To solve two problems of character recognition for videos, low-resolution characters and extremely complex backgrounds, we apply an interpolation filter, multiframe integration and character extraction filters. Character segmentation is performed by a recognition-based segmentation method, and intermediate character recognition results are used to improve the segmentation. We also include a method for locating text areas using text-like properties and the use of a language-based postprocessing technique to increase word recognition rates, The overall recognition results are satisfactory for use in news indexing. Performing Video OCR on news video and combining its results with other video understanding techniques will improve the overall understanding of the news video content.
引用
收藏
页码:385 / 395
页数:11
相关论文
共 50 条
  • [21] Visual digests for news video libraries
    Christel, MG
    ACM MULTIMEDIA 99, PROCEEDINGS, 1999, : 303 - 311
  • [22] Music indexing and retrieval for digital music libraries
    Tseng, YH
    PROCEEDINGS OF THE FIFTH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1 AND 2, 2000, : A533 - A536
  • [23] Indexing, browsing, and searching of digital video
    Smeaton, AF
    ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 2004, 38 : 371 - 407
  • [24] Distribution alternatives for superimposed information services in digital libraries
    Murthy, S
    Maier, D
    Delcambre, L
    PEER-TO-PEER, GRID, AND SERVICE -ORIENTATION IN DIGITAL LIBRARY ARCHITECTURES, 2005, 3664 : 96 - 111
  • [25] Digital video libraries and the Internet
    Smith, JR
    IEEE COMMUNICATIONS MAGAZINE, 1999, 37 (01) : 92 - 97
  • [26] Research on Video Text Recognition Technology Based on OCR
    Ding Jie
    Zhao Guotao
    Xu Fang
    2018 10TH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA), 2018, : 457 - 462
  • [27] Unsupervised story segmentation and indexing of broadcast news video
    Pranabjyoti Haloi
    M.K. Bhuyan
    Dibyajyoti Chatterjee
    Pooja Rani Borah
    Multimedia Tools and Applications, 2023, 82 : 8645 - 8664
  • [28] Unsupervised story segmentation and indexing of broadcast news video
    Haloi, Pranabjyoti
    Bhuyan, M. K.
    Chatterjee, Dibyajyoti
    Borah, Pooja Rani
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (06) : 8645 - 8664
  • [29] NVIBRS - News video indexing, browsing and retrieval system
    Suresh, V
    Palanivel, S
    Yegnanarayana, B
    2005 INTERNATIONAL CONFERENCE ON INTELLIGENT SENSING AND INFORMATION PROCESSING, PROCEEDINGS, 2005, : 181 - 186
  • [30] News story segmentation in the Fischlar video indexing system
    O'Connor, N
    Czirjek, C
    Deasy, S
    Marlow, S
    Murphy, N
    Smeaton, A
    2001 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2001, : 418 - 421