Video OCR: indexing digital news libraries by recognition of superimposed captions

被引：84

作者：

Sato, T

Kanade, T

Hughes, EK

Smith, MA

Satoh, S

机构：

[1] Toshiba Co Ltd, Saiwai Ku, Kawasaki, Kanagawa 2108501, Japan

[2] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA

[3] Natl Ctr Sci Informat Syst NACSIS, Bunkyo Ku, Tokyo 1128640, Japan

来源：

MULTIMEDIA SYSTEMS | 1999年 / 7卷 / 05期

关键词：

digital video library; caption; index; OCR; image enhancement;

D O I：

10.1007/s005300050140

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The automatic extraction and recognition of news captions and annotations can be of great help locating topics of interest in digital news video libraries. To achieve this goal, we present a technique, called Video OCR (Optical Character Reader), which detects, extracts, and reads text areas in digital video data. In this paper, we address problems, describe the method by which Video OCR operates, and suggest applications for its use in digital news archives. To solve two problems of character recognition for videos, low-resolution characters and extremely complex backgrounds, we apply an interpolation filter, multiframe integration and character extraction filters. Character segmentation is performed by a recognition-based segmentation method, and intermediate character recognition results are used to improve the segmentation. We also include a method for locating text areas using text-like properties and the use of a language-based postprocessing technique to increase word recognition rates, The overall recognition results are satisfactory for use in news indexing. Performing Video OCR on news video and combining its results with other video understanding techniques will improve the overall understanding of the news video content.

引用

页码：385 / 395

页数：11

共 50 条

[31] Content-based news video retrieval with closed captions and time alignment
Kim, YT
Kim, JG
Chang, HS
Kang, K
Kim, J
ADVANCES IN MUTLIMEDIA INFORMATION PROCESSING - PCM 2001, PROCEEDINGS, 2001, 2195 : 879 - 884
[32] The New Generation of Citation Indexing in the Age of Digital Libraries
Liu, Mengxiong
Cabrera, Peggy
POLICY FUTURES IN EDUCATION, 2008, 6 (01): : 77 - 86
[33] Compression and full-text indexing for digital libraries
Witten, IH
Moffat, A
Bell, TC
DIGITAL LIBRARIES: CURRENT ISSUES, 1995, 916 : 181 - 201
[34] A video indexing system using character recognition
Kim, EY
Kim, KI
Jung, K
Kim, HJ
IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - 2000 DIGEST OF TECHNICAL PAPERS, 2000, : 358 - 359
[35] Automatic face recognition for video indexing applications
Torres, L
Vilà, J
PATTERN RECOGNITION, 2002, 35 (03) : 615 - 625
[36] A survey of technologies for parsing and indexing digital video
Ahanger, G
Little, TDC
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 1996, 7 (01) : 28 - 43
[37] Indexing text events in digital video databases
Gargi, U
Antani, S
Kasturi, R
FOURTEENTH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1 AND 2, 1998, : 916 - 918
[38] Video tape production system with digital captions in a code format for the deaf
Nishikawa, Satoshi, 1600, (20):
[39] Video Indexing System Based on Multimodal Information Extraction Using Combination of ASR and OCR
Varma, Sandeep
Pandey, Arunanshu
Shivam
Das, Soham
Roy, Soumya Deep
BIG-DATA-ANALYTICS IN ASTRONOMY, SCIENCE, AND ENGINEERING, BDA 2021, 2022, 13167 : 201 - 208
[40] Issues for image/video digital libraries
Manjunath, BS
Deng, YN
ISCAS '98 - PROCEEDINGS OF THE 1998 INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-6, 1998, : B595 - B598

← 1 2 3 4 5 →