Text detection in images using sparse representation with discriminative dictionaries

被引:78
作者
Zhao, Ming [1 ]
Li, Shutao [1 ]
Kwok, James [2 ]
机构
[1] Hunan Univ, Coll Elect & Informat Engn, Changsha 410082, Hunan, Peoples R China
[2] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Text detection; Sparse representation; Discriminative dictionary; SEGMENTATION; EXTRACTION; RECOGNITION; PERFORMANCE;
D O I
10.1016/j.imavis.2010.04.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text detection is important in the retrieval of texts from digital pictures, video databases and webpages. However, it can be very challenging since the text is often embedded in a complex background. In this paper, we propose a classification-based algorithm for text detection using a sparse representation with discriminative dictionaries. First, the edges are detected by the wavelet transform and scanned into patches by a sliding window. Then, candidate text areas are obtained by applying a simple classification procedure using two learned discriminative dictionaries. Finally, the adaptive run-length smoothing algorithm and projection profile analysis are used to further refine the candidate text areas. The proposed method is evaluated on the Microsoft common test set, the ICDAR 2003 text locating set, and an image set collected from the web. Extensive experiments show that the proposed method can effectively detect texts of various sizes, fonts and colors from images and videos. (c) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:1590 / 1599
页数:10
相关论文
共 33 条
[1]   K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation [J].
Aharon, Michal ;
Elad, Michael ;
Bruckstein, Alfred .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (11) :4311-4322
[2]  
[Anonymous], IMAGE VISION COMPUT
[3]  
[Anonymous], P ICPR
[4]  
[Anonymous], 2008, 2008 IEEE C COMP VIS, DOI DOI 10.1109/CVPR.2008.4587652
[5]  
ANTHIMOPOULOS M, 2010, IMAGE VISIO IN PRESS
[6]  
Avanindra, 1997, IEEE T IMAGE PROCESS, V6, P344
[8]   Text detection and recognition in images and video frames [J].
Chen, DT ;
Odobez, JM ;
Bourlard, H .
PATTERN RECOGNITION, 2004, 37 (03) :595-608
[9]   Image denoising via sparse and redundant representations over learned dictionaries [J].
Elad, Michael ;
Aharon, Michal .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (12) :3736-3745
[10]   Text detection in images based on unsupervised classification of high-frequency wavelet coefficients [J].
Gllavata, J ;
Ewerth, R ;
Freisleben, B .
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, 2004, :425-428