Improved Ring Radius Transform-Based Reconstruction for Video Character Recognition

被引:3
|
作者
Huang, Zhiheng [1 ]
Shivakumara, Palaiahnakote [2 ]
Lu, Tong [1 ]
Pal, Umapada [3 ]
Blumenstein, Michael [4 ]
Chetty, Bhaarat [5 ,6 ]
Kumar, G. Hemantha [7 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur, Malaysia
[3] Indian Stat Inst, Comp Vis & Pattern Recognit Unit, Kolkata, India
[4] Univ Technol Sydney, Australian Artificial Intelligence Inst, Sydney, NSW, Australia
[5] Google Developers Grp, Bangalore, Karnataka, India
[6] NASDAQ, Bangalore, Karnataka, India
[7] Univ Mysore, Dept Studies Comp Sci, Mysore, Karnataka, India
关键词
Video character recognition; reconstruction; ring radius transform;
D O I
10.1142/S0218001421500233
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Character shape reconstruction in video is challenging due to low contrast, complex backgrounds and arbitrary orientation of characters. This work proposes an Improved Ring Radius Transform (IRRT) for reconstructing impaired characters through medial axis prediction. At first, the technique proposes a novel idea based on the Tangent Vector (TV) concept that identifies each actual pair of end pixels caused by gaps in impaired character components. Next, the actual direction to predict medial axis pixels using IRRT for each pair of end pixels is proposed with a new normal vector concept. The process of prediction repeats iteratively to find all the medial axis pixels for every gap in question. Further, medial axis pixels with their radii are used to reconstruct the shapes of impaired characters. The proposed technique is tested on benchmark datasets consisting of video, natural scenes, objects and multi-lingual data to demonstrate that it reconstructs shapes well, even for heterogeneous data. Comparative studies with different binarization and character recognition methods show that the proposed technique is effective, useful and outperforms existing methods.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] Wavelet packet transform-based robust video watermarking technique
    GAURAV BHATNAGAR
    BALASUBRMANIAN RAMAN
    Sadhana, 2012, 37 : 371 - 388
  • [22] Hough transform-based cubic spline recognition for natural shapes
    Tung, Cheng-Huang
    Syu, Wei-Jyun
    Huang, Wei-Cheng
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2019, 18 (02) : 164 - 175
  • [23] WAVELET TRANSFORM-BASED CORRELATOR FOR THE RECOGNITION OF ROTATIONALLY DISTORTED IMAGES
    AHMED, F
    KARIM, MA
    ALAM, MS
    OPTICAL ENGINEERING, 1995, 34 (11) : 3187 - 3192
  • [24] Towards Discrete Wavelet Transform-based Human Activity Recognition
    Khare, Manish
    Jeon, Moongu
    SECOND INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2017, 10443
  • [25] Wavelet transform-based correlator for the recognition of rotationally distorted images
    Univ. of Dayton, Kalamazoo, United States
    Opt Eng, 11 (3187-3192):
  • [26] A wavelet transform-based method for improved modeling of transmission lines
    Abur, A
    Ozgun, O
    Magnago, FH
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2003, 18 (04) : 1432 - 1438
  • [27] Spherical Coordinates Transform-Based Motion Model for Panoramic Video Coding
    Wang, Yefei
    Liu, Dong
    Ma, Siwei
    Wu, Feng
    Gao, Wen
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2019, 9 (01) : 98 - 109
  • [28] PARALLEL IMPLEMENTATION OF TRANSFORM-BASED DCT FILTER BANK FOR VIDEO COMMUNICATIONS
    CHIU, CT
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 1994, 40 (03) : 473 - 475
  • [29] Video-based handwritten character recognition
    Tang, XO
    Lin, F
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 3748 - 3751
  • [30] Algorithm of a Perspective Transform-Based PDF417 Barcode Recognition
    Young Jung Kim
    Jong Yun Lee
    Wireless Personal Communications, 2016, 89 : 893 - 911