Efficient skew detection of printed document images based on novel combination of enhanced profiles

被引:11
|
作者
Papandreou, A. [1 ,2 ]
Gatos, B. [2 ]
Perantonis, S. J. [2 ]
Gerardis, I. [2 ]
机构
[1] Univ Athens, Dept Informat & Telecommun, Athens 15784, Greece
[2] Natl Ctr Sci Res Demokritos, Inst Informat & Telecommun, Athens 15310, Greece
关键词
Document skew correction; Projection profiles; Document image preprocessing;
D O I
10.1007/s10032-014-0228-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document skew is often introduced during the capturing process of the document image processing pipeline and may seriously affect the performance of subsequent stages of segmentation and recognition. Skew detection is often accomplished with the use of horizontal projections, while recently, a new approach that is based on vertical projections has been introduced. In this paper, we use the technique of minimum bounding box area in order to combine a horizontal with a new reinforced vertical projection profile method. We are motivated by the fact that the horizontal and the novel vertical projection profiles are found to be complementary to each other. We claim that the proposed approach has more accurate performance compared with other state-of-the-art skew detection algorithms; it deals with all the drawbacks of the projection profile methods; it is more noise and warp resistant and gives accurate results for any kind of printed document image. For these reasons, it can be efficiently applied to historical machine printed or multicolumn documents, documents with figures and tables, while it is robust for any kind of script. Extended experimental results on two databases in different skew angle range, with representative printed documents of all kinds, as well as printed documents of two historical books, prove the efficiency of the proposed approach. There is also a comparison with commercial products in several cases where the contribution of the proposed algorithm is demonstrated at optical character recognition level. Moreover, an analysis of the accuracy performance of the main elements of the proposed technique is also performed.
引用
收藏
页码:433 / 454
页数:22
相关论文
共 50 条
  • [1] Efficient skew detection of printed document images based on novel combination of enhanced profiles
    A. Papandreou
    B. Gatos
    S. J. Perantonis
    I. Gerardis
    International Journal on Document Analysis and Recognition (IJDAR), 2014, 17 : 433 - 454
  • [2] Skew detection and reconstruction of color-printed document images
    Chen, YK
    Wang, JF
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2001, E84D (08): : 1018 - 1024
  • [3] Skew detection and reconstruction of color-printed document images
    Chen, Yi-Kai
    Wang, Jhing-Fa
    IEICE Transactions on Information and Systems, 2001, E84-D (08) : 1018 - 1024
  • [4] Skew detection, page segmentation, and script classification of printed document images
    Waked, B
    Bergler, S
    Suen, CY
    Khoury, S
    1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, : 4470 - 4475
  • [5] Skew detection in document images based on rectangular active contour
    Huijie Fan
    Linlin Zhu
    Yandong Tang
    International Journal on Document Analysis and Recognition (IJDAR), 2010, 13 : 261 - 269
  • [6] Skew detection in document images based on rectangular active contour
    Fan, Huijie
    Zhu, Linlin
    Tang, Yandong
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2010, 13 (04) : 261 - 269
  • [7] DOCUMENT SKEW DETECTION - A NOVEL APPROACH
    Manjunath, A. V. N.
    Hemantha, K. G.
    Noushath, S.
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2008, 8 (01) : 47 - 59
  • [8] New Fast Content Based Skew Detection Algorithm for Document Images
    Amir, Mohd
    Jindal, Abhishek
    PATTERN RECOGNITION AND INFORMATION PROCESSING, 2017, 673 : 36 - 43
  • [9] Efficient skew estimation and correction algorithm for document images
    Kwag, HK
    Kim, SH
    Jeong, SH
    Lee, GS
    IMAGE AND VISION COMPUTING, 2002, 20 (01) : 25 - 35
  • [10] Skew detection and correction in document images based on straight-line fitting
    Cao, Y
    Wang, SH
    Li, H
    PATTERN RECOGNITION LETTERS, 2003, 24 (12) : 1871 - 1879