A Caption Text Detection Method from Images/Videos for Efficient Indexing and Retrieval of Multimedia Data

被引:4
|
作者
Tehsin, Samabia [1 ]
Masood, Asif [1 ]
Kausar, Sumaira [2 ]
Javed, Yunous [2 ]
机构
[1] NUST, MCS, Islamabad, Pakistan
[2] NUST, Coll E&ME, Islamabad, Pakistan
关键词
Text extraction; image retrieval; caption text; document analysis; ICDAR; 2013; VIDEO; IMAGES; EXTRACTION; SEGMENTATION; LOCALIZATION; RECOGNITION; CHARACTERS; ALGORITHM;
D O I
10.1142/S0218001415550034
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Textual information embedded in multimedia can provide a vital tool for indexing and retrieval. Text extraction process has many inherent problems due to the variation in font sizes, color, backgrounds and resolution. Text detection and localization are the most challenging phases of text extraction process whereas text extraction results are highly dependent upon these phases. This paper focuses on the text localization because of its very fundamental importance. Two effective feature vectors are introduced for the classification of the text and nontext objects. First feature vector is represented by the Radon transform of text candidate objects. Second feature vector is derived from the detailed geometrical analysis of text contents. Union of two feature vectors is used for the classification of text and nontext objects using support vector machine (SVM). Text detection and localization results are evaluated on two publicly available datasets namely ICDAR 2013 and IPC-Artificial text. Moreover, results are compared with state-of-the-art techniques and the Comparison demonstrates the superiority of the presented research.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] A retrieval algorithm for specific face images in airport surveillance multimedia videos on cloud computing platform
    Zhang, Ning
    Jeong, Hwa-Young
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (16) : 17129 - 17143
  • [42] A retrieval algorithm for specific face images in airport surveillance multimedia videos on cloud computing platform
    Ning Zhang
    Hwa-Young Jeong
    Multimedia Tools and Applications, 2017, 76 : 17129 - 17143
  • [43] New method for text detection and segmentation from complex images
    Liu, Fang
    Peng, Xiang
    Wang, Tianjiang
    MIPPR 2007: AUTOMATIC TARGET RECOGNITION AND IMAGE ANALYSIS; AND MULTISPECTRAL IMAGE ACQUISITION, PTS 1 AND 2, 2007, 6786
  • [44] A Fast and Accurate Text Detection Method From Complex Images
    Ren, Zhiyun
    Huang, Linlin
    PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 1144 - +
  • [45] How multimedia shape crowdfunding outcomes: The overshadowing effect of images and videos on text in campaign information
    Yang, Jialiang
    Li, Yaokuang
    Calic, Goran
    Shevchenko, Anton
    JOURNAL OF BUSINESS RESEARCH, 2020, 117 : 6 - 18
  • [46] Center block duplication detection and indexing for efficient web information retrieval
    Cadenhead, Tyrone
    Chen, Jinlin
    Cook, Terry
    2006 1ST INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT, 2006, : 424 - +
  • [47] Hierarchical cellular tree: An efficient indexing scheme for content-based retrieval on multimedia databases
    Kiranyaz, Serkan
    Gabbouj, Moncef
    IEEE TRANSACTIONS ON MULTIMEDIA, 2007, 9 (01) : 102 - 119
  • [48] Techniques and data structures for efficient multimedia retrieval based on similarity
    Lu, GJ
    IEEE TRANSACTIONS ON MULTIMEDIA, 2002, 4 (03) : 372 - 384
  • [49] Efficient indexing and retrieval of patient information from the big data using MapReduce framework and optimisation
    Merlin, N. R. Gladiss
    Prem, M. Vigilson
    JOURNAL OF INFORMATION SCIENCE, 2023, 49 (02) : 500 - 518
  • [50] Automated semantic indexing of imaging reports to support retrieval of medical images in the multimedia electronic medical record
    Lowe, HJ
    Antipov, I
    Hersh, W
    Smith, CA
    Mailhot, M
    METHODS OF INFORMATION IN MEDICINE, 1999, 38 (4-5) : 303 - 307