Video OCR: indexing digital news libraries by recognition of superimposed captions

被引:84
|
作者
Sato, T
Kanade, T
Hughes, EK
Smith, MA
Satoh, S
机构
[1] Toshiba Co Ltd, Saiwai Ku, Kawasaki, Kanagawa 2108501, Japan
[2] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[3] Natl Ctr Sci Informat Syst NACSIS, Bunkyo Ku, Tokyo 1128640, Japan
关键词
digital video library; caption; index; OCR; image enhancement;
D O I
10.1007/s005300050140
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The automatic extraction and recognition of news captions and annotations can be of great help locating topics of interest in digital news video libraries. To achieve this goal, we present a technique, called Video OCR (Optical Character Reader), which detects, extracts, and reads text areas in digital video data. In this paper, we address problems, describe the method by which Video OCR operates, and suggest applications for its use in digital news archives. To solve two problems of character recognition for videos, low-resolution characters and extremely complex backgrounds, we apply an interpolation filter, multiframe integration and character extraction filters. Character segmentation is performed by a recognition-based segmentation method, and intermediate character recognition results are used to improve the segmentation. We also include a method for locating text areas using text-like properties and the use of a language-based postprocessing technique to increase word recognition rates, The overall recognition results are satisfactory for use in news indexing. Performing Video OCR on news video and combining its results with other video understanding techniques will improve the overall understanding of the news video content.
引用
收藏
页码:385 / 395
页数:11
相关论文
共 50 条
  • [31] Content-based news video retrieval with closed captions and time alignment
    Kim, YT
    Kim, JG
    Chang, HS
    Kang, K
    Kim, J
    ADVANCES IN MUTLIMEDIA INFORMATION PROCESSING - PCM 2001, PROCEEDINGS, 2001, 2195 : 879 - 884
  • [32] The New Generation of Citation Indexing in the Age of Digital Libraries
    Liu, Mengxiong
    Cabrera, Peggy
    POLICY FUTURES IN EDUCATION, 2008, 6 (01): : 77 - 86
  • [33] Compression and full-text indexing for digital libraries
    Witten, IH
    Moffat, A
    Bell, TC
    DIGITAL LIBRARIES: CURRENT ISSUES, 1995, 916 : 181 - 201
  • [34] A video indexing system using character recognition
    Kim, EY
    Kim, KI
    Jung, K
    Kim, HJ
    IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - 2000 DIGEST OF TECHNICAL PAPERS, 2000, : 358 - 359
  • [35] Automatic face recognition for video indexing applications
    Torres, L
    Vilà, J
    PATTERN RECOGNITION, 2002, 35 (03) : 615 - 625
  • [36] A survey of technologies for parsing and indexing digital video
    Ahanger, G
    Little, TDC
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 1996, 7 (01) : 28 - 43
  • [37] Indexing text events in digital video databases
    Gargi, U
    Antani, S
    Kasturi, R
    FOURTEENTH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1 AND 2, 1998, : 916 - 918
  • [39] Video Indexing System Based on Multimodal Information Extraction Using Combination of ASR and OCR
    Varma, Sandeep
    Pandey, Arunanshu
    Shivam
    Das, Soham
    Roy, Soumya Deep
    BIG-DATA-ANALYTICS IN ASTRONOMY, SCIENCE, AND ENGINEERING, BDA 2021, 2022, 13167 : 201 - 208
  • [40] Issues for image/video digital libraries
    Manjunath, BS
    Deng, YN
    ISCAS '98 - PROCEEDINGS OF THE 1998 INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-6, 1998, : B595 - B598