Improved Ring Radius Transform-Based Reconstruction for Video Character Recognition

被引：3

作者：

Huang, Zhiheng ^{[1
]}

Shivakumara, Palaiahnakote ^{[2
]}

Lu, Tong ^{[1
]}

Pal, Umapada ^{[3
]}

Blumenstein, Michael ^{[4
]}

Chetty, Bhaarat ^{[5
,6
]}

Kumar, G. Hemantha ^{[7
]}

机构：

[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China

[2] Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur, Malaysia

[3] Indian Stat Inst, Comp Vis & Pattern Recognit Unit, Kolkata, India

[4] Univ Technol Sydney, Australian Artificial Intelligence Inst, Sydney, NSW, Australia

[5] Google Developers Grp, Bangalore, Karnataka, India

[6] NASDAQ, Bangalore, Karnataka, India

[7] Univ Mysore, Dept Studies Comp Sci, Mysore, Karnataka, India

来源：

INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE | 2021年 / 35卷 / 07期

关键词：

Video character recognition; reconstruction; ring radius transform;

D O I：

10.1142/S0218001421500233

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Character shape reconstruction in video is challenging due to low contrast, complex backgrounds and arbitrary orientation of characters. This work proposes an Improved Ring Radius Transform (IRRT) for reconstructing impaired characters through medial axis prediction. At first, the technique proposes a novel idea based on the Tangent Vector (TV) concept that identifies each actual pair of end pixels caused by gaps in impaired character components. Next, the actual direction to predict medial axis pixels using IRRT for each pair of end pixels is proposed with a new normal vector concept. The process of prediction repeats iteratively to find all the medial axis pixels for every gap in question. Further, medial axis pixels with their radii are used to reconstruct the shapes of impaired characters. The proposed technique is tested on benchmark datasets consisting of video, natural scenes, objects and multi-lingual data to demonstrate that it reconstructs shapes well, even for heterogeneous data. Comparative studies with different binarization and character recognition methods show that the proposed technique is effective, useful and outperforms existing methods.

引用

页数：24

共 50 条

[21] Wavelet packet transform-based robust video watermarking technique
GAURAV BHATNAGAR
BALASUBRMANIAN RAMAN
Sadhana, 2012, 37 : 371 - 388
[22] Hough transform-based cubic spline recognition for natural shapes
Tung, Cheng-Huang
Syu, Wei-Jyun
Huang, Wei-Cheng
INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2019, 18 (02) : 164 - 175
[23] WAVELET TRANSFORM-BASED CORRELATOR FOR THE RECOGNITION OF ROTATIONALLY DISTORTED IMAGES
AHMED, F
KARIM, MA
ALAM, MS
OPTICAL ENGINEERING, 1995, 34 (11) : 3187 - 3192
[24] Towards Discrete Wavelet Transform-based Human Activity Recognition
Khare, Manish
Jeon, Moongu
SECOND INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2017, 10443
[25] Wavelet transform-based correlator for the recognition of rotationally distorted images
Univ. of Dayton, Kalamazoo, United States
Opt Eng, 11 (3187-3192):
[26] A wavelet transform-based method for improved modeling of transmission lines
Abur, A
Ozgun, O
Magnago, FH
IEEE TRANSACTIONS ON POWER SYSTEMS, 2003, 18 (04) : 1432 - 1438
[27] Spherical Coordinates Transform-Based Motion Model for Panoramic Video Coding
Wang, Yefei
Liu, Dong
Ma, Siwei
Wu, Feng
Gao, Wen
IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2019, 9 (01) : 98 - 109
[28] PARALLEL IMPLEMENTATION OF TRANSFORM-BASED DCT FILTER BANK FOR VIDEO COMMUNICATIONS
CHIU, CT
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 1994, 40 (03) : 473 - 475
[29] Video-based handwritten character recognition
Tang, XO
Lin, F
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 3748 - 3751
[30] Algorithm of a Perspective Transform-Based PDF417 Barcode Recognition
Young Jung Kim
Jong Yun Lee
Wireless Personal Communications, 2016, 89 : 893 - 911

← 1 2 3 4 5 →