Text From Corners: A Novel Approach to Detect Text and Caption in Videos

被引：96

作者：

Zhao, Xu ^{[1
,4
]}

Lin, Kai-Hsiang ^{[1
]}

Fu, Yun ^{[2
]}

Hu, Yuxiao ^{[3
]}

Liu, Yuncai ^{[4
]}

Huang, Thomas S. ^{[1
]}

机构：

[1] Univ Illinois, Beckman Inst Adv Sci & Technol, Urbana, IL 61801 USA

[2] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14260 USA

[3] Microsoft Live Search, Redmond, WA 98052 USA

[4] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2011年 / 20卷 / 03期

基金：

美国国家科学基金会;

关键词：

Caption detection; Harris corner detector; moving caption; optical flow; text detection; video retrieval; IMAGE RETRIEVAL; LOCALIZATION; EXTRACTION;

D O I：

10.1109/TIP.2010.2068553

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Detecting text and caption from videos is important and in great demand for video retrieval, annotation, indexing, and content analysis. In this paper, we present a corner based approach to detect text and caption from videos. This approach is inspired by the observation that there exist dense and orderly presences of corner points in characters, especially in text and caption. We use several discriminative features to describe the text regions formed by the corner points. The usage of these features is in a flexible manner, thus, can be adapted to different applications. Language independence is an important advantage of the proposed method. Moreover, based upon the text features, we further develop a novel algorithm to detect moving captions in videos. In the algorithm, the motion features, extracted by optical flow, are combined with text features to detect the moving caption patterns. The decision tree is adopted to learn the classification criteria. Experiments conducted on a large volume of real video shots demonstrate the efficiency and robustness of our proposed approaches and the real-world system. Our text and caption detection system was recently highlighted in a worldwide multimedia retrieval competition, Star Challenge, by achieving the superior performance with the top ranking.

引用

页码：790 / 799

页数：10

共 50 条

[21] On the segmentation of text in videos
Wernicke, A
Lienhart, R
2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1511 - 1514
[22] TAP: Text-Aware Pre-training for Text-VQA and Text-Caption
Yang, Zhengyuan
Lu, Yijuan
Wang, Jianfeng
Yin, Xi
Florencio, Dinei
Wang, Lijuan
Zhang, Cha
Zhang, Lei
Luo, Jiebo
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8747 - 8757
[23] Text Extraction from videos using MapReduce
Roshan, Chanchal Kumar
Kaushal, Rajeet
Alam, Sha
Rai, Shashank
Gholap, Yuvraj
PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICCS), 2019, : 431 - 434
[24] A Probabilistic Approach to Text Generation of Human Motions extracted from Kinect Videos
Kobayashi, Mizuki
Kobayashi, Ichiro
Asoh, Hideki
Guadarrama, Sergio
WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, WCECS 2013, VOL I, 2013, I : 18 - +
[25] Caption Detection and Text Recognition in News Video
Yang, Zhe
Shi, Ping
2012 5TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), 2012, : 188 - 191
[26] A novel approach for extracting text from color documents
Annamalai University, Annamalai Nagar, Tamil Nadu, India
World Acad. Sci. Eng. Technol., 2009, (1147-1152):
[27] Novel Approach for the Extraction of Keywords from Text Document
Kulkarni, R. N.
Koduri, Swetha
2024 INTERNATIONAL CONFERENCE ON SOCIAL AND SUSTAINABLE INNOVATIONS IN TECHNOLOGY AND ENGINEERING, SASI-ITE 2024, 2024, : 266 - 271
[28] A NOVEL ALGORITHM FOR EXTRACTING TEXT LABELS AND SUBFIGURE CAPTIONS FROM MULTI-PANEL FIGURE CAPTION
Ali, Mushtaq
Dong, Le
Liang, Yan
Xu, Zongyi
He, Ling
Feng, Ning
2014 11TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2014, : 226 - 229
[29] Predicting Visual Features From Text for Image and Video Caption Retrieval
Dong, Jianfeng
Li, Xirong
Snoek, Cees G. M.
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (12) : 3377 - 3388
[30] A Deep Learning Approach to Detect Abusive Bengali Text
Emon, Estiak Ahmed
Rahman, Shihab
Banarjee, Joti
Das, Amit Kumar
Mittra, Tanni
2019 7TH INTERNATIONAL CONFERENCE ON SMART COMPUTING & COMMUNICATIONS (ICSCC), 2019, : 108 - 112

← 1 2 3 4 5 →