Cloud of Line Distribution and Random Forest Based Text Detection from Natural/Video Scene Images

被引:0
|
作者
Wang, Wenhai [1 ]
Wu, Yirui [2 ]
Shivakumara, Palaiahnakote [3 ]
Lu, Tong [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Jiangsu, Peoples R China
[2] Hohai Univ, Coll Comp & Informat, Nanjing, Jiangsu, Peoples R China
[3] Univ Malaya, Dept Comp Syst & Informat Technol, Kuala Lumpur, Malaysia
来源
关键词
COLD; Random forest; Text detection in natural scene image; Text detection in video image;
D O I
10.1007/978-3-319-73600-6_5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text detection in natural and video scene images is still considered to be challenging due to unpredictable nature of scene texts. This paper presents a new method based on Cloud of Line Distribution (COLD) and Random Forest Classifier for text detection in both natural and video images. The proposed method extracts unique shapes of text components by studying the relationship between dominant points such as straight or cursive over contours of text components, which is called COLD in polar domain. We consider edge components as text candidates if the edge components in Canny and Sobel of an input image share the COLD property. For each text candidate, we further study its COLD distribution at component level to extract statistical features and angle oriented features. Next, these features are fed to a random forest classifier to eliminate false text candidates, which results representatives. We then perform grouping using representatives to form text lines based on the distances between edge components in the edge image. The statistical and angle orientated features are finally extracted at word level for eliminating false positives, which results in text detection. The proposed method is tested on standard database, namely, SVT, ICDAR 2015 scene, ICDAR2013 scene and video databases, to show its effectiveness and usefulness compared with the existing methods.
引用
收藏
页码:48 / 60
页数:13
相关论文
共 50 条
  • [41] Deep learning for detection of text polarity in natural scene images
    Perepu, Pavan Kumar
    NEUROCOMPUTING, 2021, 431 : 1 - 6
  • [42] Text Detection in Natural Scene Images Leveraging Context Information
    Wang, Runmin
    Sang, Nong
    Gao, Changxin
    Kuang, Xiaoqin
    Xiang, Jun
    PATTERN RECOGNITION (CCPR 2014), PT II, 2014, 484 : 444 - 454
  • [43] A hierarchical recursive method for text detection in natural scene images
    Xiaobing Wang
    Yonghong Song
    Yuanlin Zhang
    Jingmin Xin
    Multimedia Tools and Applications, 2017, 76 : 26201 - 26223
  • [44] Text Detection in Natural Scene Images by Stroke Gabor Words
    Yi, Chucai
    Tian, Yingli
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 177 - 181
  • [45] A robust algorithm for text region detection in natural scene images
    Park, Jonghyun
    Lee, Gueesang
    CANADIAN JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING-REVUE CANADIENNE DE GENIE ELECTRIQUE ET INFORMATIQUE, 2008, 33 (3-4): : 215 - 222
  • [46] A two level algorithm for text detection in natural scene images
    Rong, Li
    Wang Suyu
    Shi, ZhiXin
    2014 11TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS 2014), 2014, : 329 - 333
  • [47] A robust arbitrary text detection system for natural scene images
    Risnumawan, Anhar
    Shivakumara, Palaiahankote
    Chan, Chee Seng
    Tan, Chew Lim
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (18) : 8027 - 8048
  • [48] A Database for Urdu Text Detection and Recognition in Natural Scene Images
    Chandio, Asghar Ali
    Leghari, Mehwish
    Memon, Mukhtiar Ahmed
    Leghari, Mehjabeen
    Jalbani, Akhtar Hussain
    MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2020, 39 (01) : 47 - 54
  • [49] Urdu text in natural scene images: a new dataset and preliminary text detection
    Ali, Hazrat
    Iqbal, Khalid
    Mujtaba, Ghulam
    Fayyaz, Ahmad
    Bulbul, Mohammad Farhad
    Karam, Fazal Wahab
    Zahir, Ali
    PEERJ COMPUTER SCIENCE, 2021, 7
  • [50] Urdu text in natural scene images: a new dataset and preliminary text detection
    Ali H.
    Iqbal K.
    Mujtaba G.
    Fayyaz A.
    Bulbul M.F.
    Karam F.W.
    Zahir A.
    PeerJ Computer Science, 2021, 7 : 1 - 17