Text From Corners: A Novel Approach to Detect Text and Caption in Videos

被引：96

作者：

Zhao, Xu ^{[1
,4
]}

Lin, Kai-Hsiang ^{[1
]}

Fu, Yun ^{[2
]}

Hu, Yuxiao ^{[3
]}

Liu, Yuncai ^{[4
]}

Huang, Thomas S. ^{[1
]}

机构：

[1] Univ Illinois, Beckman Inst Adv Sci & Technol, Urbana, IL 61801 USA

[2] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14260 USA

[3] Microsoft Live Search, Redmond, WA 98052 USA

[4] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2011年 / 20卷 / 03期

基金：

美国国家科学基金会;

关键词：

Caption detection; Harris corner detector; moving caption; optical flow; text detection; video retrieval; IMAGE RETRIEVAL; LOCALIZATION; EXTRACTION;

D O I：

10.1109/TIP.2010.2068553

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Detecting text and caption from videos is important and in great demand for video retrieval, annotation, indexing, and content analysis. In this paper, we present a corner based approach to detect text and caption from videos. This approach is inspired by the observation that there exist dense and orderly presences of corner points in characters, especially in text and caption. We use several discriminative features to describe the text regions formed by the corner points. The usage of these features is in a flexible manner, thus, can be adapted to different applications. Language independence is an important advantage of the proposed method. Moreover, based upon the text features, we further develop a novel algorithm to detect moving captions in videos. In the algorithm, the motion features, extracted by optical flow, are combined with text features to detect the moving caption patterns. The decision tree is adopted to learn the classification criteria. Experiments conducted on a large volume of real video shots demonstrate the efficiency and robustness of our proposed approaches and the real-world system. Our text and caption detection system was recently highlighted in a worldwide multimedia retrieval competition, Star Challenge, by achieving the superior performance with the top ranking.

引用

页码：790 / 799

页数：10

共 50 条

[1] Caption Text Location with Combined Features for News Videos
Su, Yuting
Ji, Zhong
Song, Xingguang
Hua, Rui
2008 INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND TRAINING AND 2008 INTERNATIONAL WORKSHOP ON GEOSCIENCE AND REMOTE SENSING, VOL 1, PROCEEDINGS, 2009, : 714 - 718
[2] An Improved Technique to Detect Text from Scene Videos
Pote, Saloni A.
Mehta, Mayuri A.
2017 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2017, : 1190 - 1194
[3] Robustly detect different types of text in videos
Cai, Yuanqiang
Wang, Weiqiang
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (16): : 12827 - 12840
[4] Robustly detect different types of text in videos
Yuanqiang Cai
Weiqiang Wang
Neural Computing and Applications, 2020, 32 : 12827 - 12840
[5] Key Frame Extraction, Localization and Segmentation of Caption Text in News Videos
Phadke, Harsha H.
Mallika, H.
2017 2ND IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2017, : 543 - 547
[6] A system for detection of moving caption text in videos: a news use case
Elshahaby, Hossam
Rashwan, Mohsen
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (17) : 25607 - 25631
[7] A system for detection of moving caption text in videos: a news use case
Hossam Elshahaby
Mohsen Rashwan
Multimedia Tools and Applications, 2021, 80 : 25607 - 25631
[8] A novel approach to text detection and extraction from videos by discriminative features and density
1600, Chinese Institute of Electronics (23):
[9] A Novel Approach to Text Detection and Extraction from Videos by Discriminative Features and Density
Wei Baogang
Zhang Yin
Yuan Jie
Liu Yonghuai
Wang Lidong
CHINESE JOURNAL OF ELECTRONICS, 2014, 23 (02) : 322 - 328
[10] A Novel Multi-Oriented Chinese Text Extraction Approach from Videos
Liu, Yang
Song, Yonghong
Zhang, Yuanlin
Meng, Quan
2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 1355 - 1359

← 1 2 3 4 5 →