Video OCR: indexing digital news libraries by recognition of superimposed captions

被引：84

作者：

Sato, T

Kanade, T

Hughes, EK

Smith, MA

Satoh, S

机构：

[1] Toshiba Co Ltd, Saiwai Ku, Kawasaki, Kanagawa 2108501, Japan

[2] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA

[3] Natl Ctr Sci Informat Syst NACSIS, Bunkyo Ku, Tokyo 1128640, Japan

来源：

MULTIMEDIA SYSTEMS | 1999年 / 7卷 / 05期

关键词：

digital video library; caption; index; OCR; image enhancement;

D O I：

10.1007/s005300050140

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The automatic extraction and recognition of news captions and annotations can be of great help locating topics of interest in digital news video libraries. To achieve this goal, we present a technique, called Video OCR (Optical Character Reader), which detects, extracts, and reads text areas in digital video data. In this paper, we address problems, describe the method by which Video OCR operates, and suggest applications for its use in digital news archives. To solve two problems of character recognition for videos, low-resolution characters and extremely complex backgrounds, we apply an interpolation filter, multiframe integration and character extraction filters. Character segmentation is performed by a recognition-based segmentation method, and intermediate character recognition results are used to improve the segmentation. We also include a method for locating text areas using text-like properties and the use of a language-based postprocessing technique to increase word recognition rates, The overall recognition results are satisfactory for use in news indexing. Performing Video OCR on news video and combining its results with other video understanding techniques will improve the overall understanding of the news video content.

引用

页码：385 / 395

页数：11

共 50 条

[11] Mathematical Symbol Indexing for Digital Libraries
Marinai, Simone
Miotti, Beatrice
Soda, Giovanni
DIGITAL LIBRARIES, 2010, 91 : 113 - 124
[12] Using Closed Captions as Supervision for Video Activity Recognition
Gupta, Sonal
Mooney, Raymond J.
PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 1083 - 1088
[13] A semi-automatic adaptive OCR for digital libraries
Rawat, S
Kumar, KSS
Meshesha, M
Sikdar, ID
Balasubramanian, A
Jawahar, CV
DOCUMENT ANALYSIS SYSTEMS VII, PROCEEDINGS, 2006, 3872 : 13 - 24
[14] Digital libraries for electronic news
Shepherd, MA
Watters, CR
Burkowski, FJ
DIGITAL LIBRARIES: RESEARCH AND TECHNOLOGY ADVANCES, 1996, 1082 : 55 - 62
[15] Text extraction, enhancement and OCR in digital video
Li, HP
Doermann, D
Kia, O
DOCUMENT ANALYSIS SYSTEMS: THEORY AND PRACTICE, 1999, 1655 : 363 - 377
[16] Transcribing broadcast news for audio and video indexing
Gauvain, JL
Lamel, L
Adda, G
COMMUNICATIONS OF THE ACM, 2000, 43 (02) : 64 - 70
[17] The semantic pathfinder for generic news video indexing
Snoek, C. G. M.
Worring, M.
Geusebroek, J. M.
Koelma, D. C.
Seinstra, F. J.
Smeulders, A. W. M.
2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1469 - +
[18] Multimodal approach for summarizing and indexing news video
Kim, JG
Chang, HS
Kim, YT
Kang, K
Kim, M
Kim, J
Kim, HM
ETRI JOURNAL, 2002, 24 (01) : 1 - 11
[19] A NOVEL AUDIOVISUAL ANALYSIS FOR NEWS VIDEO INDEXING
Huang Yubin
Dong Yuan
Dong Chengyu
Wang Haila
PROCEEDINGS OF 2009 2ND IEEE INTERNATIONAL CONFERENCE ON BROADBAND NETWORK & MULTIMEDIA TECHNOLOGY, 2009, : 486 - 490
[20] Color object indexing and retrieval in digital libraries
Wei, H
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2002, 11 (08) : 912 - 922

← 1 2 3 4 5 →