A Caption Text Detection Method from Images/Videos for Efficient Indexing and Retrieval of Multimedia Data

被引:4
|
作者
Tehsin, Samabia [1 ]
Masood, Asif [1 ]
Kausar, Sumaira [2 ]
Javed, Yunous [2 ]
机构
[1] NUST, MCS, Islamabad, Pakistan
[2] NUST, Coll E&ME, Islamabad, Pakistan
关键词
Text extraction; image retrieval; caption text; document analysis; ICDAR; 2013; VIDEO; IMAGES; EXTRACTION; SEGMENTATION; LOCALIZATION; RECOGNITION; CHARACTERS; ALGORITHM;
D O I
10.1142/S0218001415550034
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Textual information embedded in multimedia can provide a vital tool for indexing and retrieval. Text extraction process has many inherent problems due to the variation in font sizes, color, backgrounds and resolution. Text detection and localization are the most challenging phases of text extraction process whereas text extraction results are highly dependent upon these phases. This paper focuses on the text localization because of its very fundamental importance. Two effective feature vectors are introduced for the classification of the text and nontext objects. First feature vector is represented by the Radon transform of text candidate objects. Second feature vector is derived from the detailed geometrical analysis of text contents. Union of two feature vectors is used for the classification of text and nontext objects using support vector machine (SVM). Text detection and localization results are evaluated on two publicly available datasets namely ICDAR 2013 and IPC-Artificial text. Moreover, results are compared with state-of-the-art techniques and the Comparison demonstrates the superiority of the presented research.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Semantic Indexing for Efficient Retrieval of Multimedia Data
    Cao, Xiaoqi
    Klusch, Matthias
    ADAPTIVE MULTIMEDIA RETRIEVAL: SEMANTICS, CONTEXT, AND ADAPTATION, AMR 2012, 2014, 8382 : 165 - 180
  • [2] A graphical model for multimedia data indexing and retrieval
    Dagtas, S
    CISST '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON IMAGING SCIENCE, SYSTEMS, AND TECHNOLOGY, 2004, : 469 - 475
  • [3] Text From Corners: A Novel Approach to Detect Text and Caption in Videos
    Zhao, Xu
    Lin, Kai-Hsiang
    Fu, Yun
    Hu, Yuxiao
    Liu, Yuncai
    Huang, Thomas S.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 20 (03) : 790 - 799
  • [4] Efficient indexing for Query By String text retrieval
    Ghosh, Suman K.
    Gomez, Liuis
    Karatzas, Dimosthenis
    Valveny, Ernest
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 1236 - 1240
  • [5] A system for detection of moving caption text in videos: a news use case
    Elshahaby, Hossam
    Rashwan, Mohsen
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (17) : 25607 - 25631
  • [6] A system for detection of moving caption text in videos: a news use case
    Hossam Elshahaby
    Mohsen Rashwan
    Multimedia Tools and Applications, 2021, 80 : 25607 - 25631
  • [7] Efficient Visual Search of Videos Cast as Text Retrieval
    Sivic, Josef
    Zisserman, Andrew
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (04) : 591 - 606
  • [8] An efficient method of indexing for image retrieval from pdf files
    Mata, Jacinto
    Crespo, Mariano
    Mana, Manuel J.
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (45): : 21 - 29
  • [9] A hierarchical bitmap indexing method for content based multimedia retrieval
    Park, J
    Nang, J
    PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON INTERNET AND MULTIMEDIA SYSTEMS AND APPLICATIONS, 2006, : 223 - +
  • [10] An Efficient Indexing Structure for Content Based Multimedia Retrieval with Relevance Feedback
    Nang, Jongho
    Park, Joohyoun
    APPLIED COMPUTING 2007, VOL 1 AND 2, 2007, : 517 - 524