Text From Corners: A Novel Approach to Detect Text and Caption in Videos

被引:96
|
作者
Zhao, Xu [1 ,4 ]
Lin, Kai-Hsiang [1 ]
Fu, Yun [2 ]
Hu, Yuxiao [3 ]
Liu, Yuncai [4 ]
Huang, Thomas S. [1 ]
机构
[1] Univ Illinois, Beckman Inst Adv Sci & Technol, Urbana, IL 61801 USA
[2] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14260 USA
[3] Microsoft Live Search, Redmond, WA 98052 USA
[4] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
基金
美国国家科学基金会;
关键词
Caption detection; Harris corner detector; moving caption; optical flow; text detection; video retrieval; IMAGE RETRIEVAL; LOCALIZATION; EXTRACTION;
D O I
10.1109/TIP.2010.2068553
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting text and caption from videos is important and in great demand for video retrieval, annotation, indexing, and content analysis. In this paper, we present a corner based approach to detect text and caption from videos. This approach is inspired by the observation that there exist dense and orderly presences of corner points in characters, especially in text and caption. We use several discriminative features to describe the text regions formed by the corner points. The usage of these features is in a flexible manner, thus, can be adapted to different applications. Language independence is an important advantage of the proposed method. Moreover, based upon the text features, we further develop a novel algorithm to detect moving captions in videos. In the algorithm, the motion features, extracted by optical flow, are combined with text features to detect the moving caption patterns. The decision tree is adopted to learn the classification criteria. Experiments conducted on a large volume of real video shots demonstrate the efficiency and robustness of our proposed approaches and the real-world system. Our text and caption detection system was recently highlighted in a worldwide multimedia retrieval competition, Star Challenge, by achieving the superior performance with the top ranking.
引用
收藏
页码:790 / 799
页数:10
相关论文
共 50 条
  • [41] Adaptive Text Denoising Network for Image Caption Editing
    Yuan, Mengqi
    Bao, Bing-Kun
    Tan, Zhiyi
    Xu, Changsheng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)
  • [42] Urdu Caption Text Detection using Textural Features
    Mirza, Ali
    Fayyaz, Marium
    Seher, Zunera
    Siddiqi, Imran
    PROCEEDINGS OF THE 2ND MEDITERRANEAN CONFERENCE ON PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE (MEDPRAI-2018), 2018, : 70 - 75
  • [43] Caption Text and Keyframe Based Video Retrieval System
    Dung Mai
    Kiem Hoang
    COMPUTATIONAL COLLECTIVE INTELLIGENCE - TECHNOLOGIES AND APPLICATIONS, PT II, 2012, 7654 : 244 - 252
  • [44] Image difference caption generation with text information assistance
    Chen W.
    Wang W.
    Jin Q.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2022, 48 (08): : 1436 - 1444
  • [45] Caption text recognition in video frames by MAP matching
    Nakamura, A
    Yamamoto, K
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 650 - 654
  • [46] A novel hybrid approach for text encoding: Cognitive Attention To Syntax model to detect online misinformation
    Faye, Geraud
    Ouerdane, Wassila
    Gadek, Guillaume
    Gahbiche, Souhir
    Gatepaille, Sylvain
    DATA & KNOWLEDGE ENGINEERING, 2023, 148
  • [47] Tagging Webcast Text in Baseball Videos by Video Segmentation and Text Alignment
    Chiu, Chih-Yi
    Lin, Po-Chih
    Li, Sheng-Yang
    Tsai, Tsung-Han
    Tsai, Yu-Lung
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2012, 22 (07) : 999 - 1013
  • [48] Efficient Approach to Detect and Localize Text in Natural Scene Images
    Samuel, S. R. Surem
    Christopher, C. Seldev
    ARTIFICIAL INTELLIGENCE AND EVOLUTIONARY ALGORITHMS IN ENGINEERING SYSTEMS, VOL 2, 2015, 325 : 665 - 674
  • [49] Using personal writings to detect dementia: A text mining approach
    Asllani, Beni
    Mullen, Deborah M.
    HEALTH INFORMATICS JOURNAL, 2023, 29 (04)
  • [50] A Novel Approach for Text Steganography: Generating Text Summary using Reflection Symmetry
    Majumder, Anandaprova
    Changder, Suvamoy
    FIRST INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE: MODELING TECHNIQUES AND APPLICATIONS (CIMTA) 2013, 2013, 10 : 112 - 120