A Caption Text Detection Method from Images/Videos for Efficient Indexing and Retrieval of Multimedia Data

被引：4

作者：

Tehsin, Samabia ^{[1
]}

Masood, Asif ^{[1
]}

Kausar, Sumaira ^{[2
]}

Javed, Yunous ^{[2
]}

机构：

[1] NUST, MCS, Islamabad, Pakistan

[2] NUST, Coll E&ME, Islamabad, Pakistan

来源：

INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE | 2015年 / 29卷 / 01期

关键词：

Text extraction; image retrieval; caption text; document analysis; ICDAR; 2013; VIDEO; IMAGES; EXTRACTION; SEGMENTATION; LOCALIZATION; RECOGNITION; CHARACTERS; ALGORITHM;

D O I：

10.1142/S0218001415550034

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Textual information embedded in multimedia can provide a vital tool for indexing and retrieval. Text extraction process has many inherent problems due to the variation in font sizes, color, backgrounds and resolution. Text detection and localization are the most challenging phases of text extraction process whereas text extraction results are highly dependent upon these phases. This paper focuses on the text localization because of its very fundamental importance. Two effective feature vectors are introduced for the classification of the text and nontext objects. First feature vector is represented by the Radon transform of text candidate objects. Second feature vector is derived from the detailed geometrical analysis of text contents. Union of two feature vectors is used for the classification of text and nontext objects using support vector machine (SVM). Text detection and localization results are evaluated on two publicly available datasets namely ICDAR 2013 and IPC-Artificial text. Moreover, results are compared with state-of-the-art techniques and the Comparison demonstrates the superiority of the presented research.

引用

页数：23

共 50 条

[41] A retrieval algorithm for specific face images in airport surveillance multimedia videos on cloud computing platform
Zhang, Ning
Jeong, Hwa-Young
MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (16) : 17129 - 17143
[42] A retrieval algorithm for specific face images in airport surveillance multimedia videos on cloud computing platform
Ning Zhang
Hwa-Young Jeong
Multimedia Tools and Applications, 2017, 76 : 17129 - 17143
[43] New method for text detection and segmentation from complex images
Liu, Fang
Peng, Xiang
Wang, Tianjiang
MIPPR 2007: AUTOMATIC TARGET RECOGNITION AND IMAGE ANALYSIS; AND MULTISPECTRAL IMAGE ACQUISITION, PTS 1 AND 2, 2007, 6786
[44] A Fast and Accurate Text Detection Method From Complex Images
Ren, Zhiyun
Huang, Linlin
PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 1144 - +
[45] How multimedia shape crowdfunding outcomes: The overshadowing effect of images and videos on text in campaign information
Yang, Jialiang
Li, Yaokuang
Calic, Goran
Shevchenko, Anton
JOURNAL OF BUSINESS RESEARCH, 2020, 117 : 6 - 18
[46] Center block duplication detection and indexing for efficient web information retrieval
Cadenhead, Tyrone
Chen, Jinlin
Cook, Terry
2006 1ST INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT, 2006, : 424 - +
[47] Hierarchical cellular tree: An efficient indexing scheme for content-based retrieval on multimedia databases
Kiranyaz, Serkan
Gabbouj, Moncef
IEEE TRANSACTIONS ON MULTIMEDIA, 2007, 9 (01) : 102 - 119
[48] Techniques and data structures for efficient multimedia retrieval based on similarity
Lu, GJ
IEEE TRANSACTIONS ON MULTIMEDIA, 2002, 4 (03) : 372 - 384
[49] Efficient indexing and retrieval of patient information from the big data using MapReduce framework and optimisation
Merlin, N. R. Gladiss
Prem, M. Vigilson
JOURNAL OF INFORMATION SCIENCE, 2023, 49 (02) : 500 - 518
[50] Automated semantic indexing of imaging reports to support retrieval of medical images in the multimedia electronic medical record
Lowe, HJ
Antipov, I
Hersh, W
Smith, CA
Mailhot, M
METHODS OF INFORMATION IN MEDICINE, 1999, 38 (4-5) : 303 - 307

← 1 2 3 4 5 →