Automatic script identification from document images using cluster-based templates

被引:112
|
作者
Hochberg, J
Kelly, P
Thomas, T
Kerns, L
机构
关键词
script identification; document analysis; optical character recognition;
D O I
10.1109/34.574802
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe an automated script identification system for typeset document images. Templates for each script are created by clustering textual symbols from a training set. Symbols from new images are compared to the templates to find the best script. Our current system processes thirteen scripts with minimal preprocessing and high accuracy.
引用
收藏
页码:176 / 181
页数:6
相关论文
共 50 条
  • [41] Local features-based script recognition from printed bilingual document images
    Abirami, S.
    Manjula, D.
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2010, 38 (04) : 283 - 297
  • [42] CLUSTER-BASED SEGMENTATION OF RANGE IMAGES USING DIFFERENTIAL-GEOMETRIC FEATURES
    KRISHNAPURAM, R
    MUNSHI, A
    OPTICAL ENGINEERING, 1991, 30 (10) : 1468 - 1478
  • [43] Script Identification of Multilingual Document Images Based on Block Finite Ridgelet Transform and Discrete Curvelet Transform
    Wu, Zheng-Jian
    Hasimu, Reyihanguli
    Mamat, Hoinisa
    Aysa, Alimjan
    Ubul, Kurban
    PROCEEDINGS OF 2020 2ND INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND MACHINE VISION AND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND MACHINE LEARNING, IPMV 2020, 2020, : 87 - 93
  • [44] Script-Independent Text Segmentation from Document Images
    Sahare P.
    Tembhurne J.V.
    Parate M.R.
    Diwan T.
    Dhok S.B.
    International Journal of Ambient Computing and Intelligence, 2022, 13 (01)
  • [45] Interactive Cluster-Based Personalized Retrieval on Large Document Collections
    Belsis, Petros
    Konstantopoulos, Charalampos
    Mamalis, Basilis
    Pantzioul, Grarnmati
    Skourlas, Christos
    NEW DIRECTIONS IN INTELLIGENT INTERACTIVE MULTIMEDIA, 2008, 142 : 211 - +
  • [46] Hybrid Indexing for Versioned Document Search with Cluster-based Retrieval
    Jin, Xin
    Agun, Daniel
    Yang, Tao
    Wu, Qinghao
    Shen, Yifan
    Zhao, Susen
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 377 - 386
  • [47] A Fuzzy Cluster-based Algorithm for Peptide Identification
    Liang, Xijun
    Xia, Zhonghang
    Niu, Xinnan
    Link, Andrew J.
    Pang, Liping
    Wu, Fangxiang
    Zhang, Hongwei
    2012 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW), 2012,
  • [48] CLUE: Cluster-based retrieval of images by unsupervised learning
    Chen, YX
    Wang, JZ
    Krovetz, R
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2005, 14 (08) : 1187 - 1201
  • [49] Query Medical Images by Cluster-Based Texture Matching
    Yang Wu
    Xu Hui
    Guo Hongxing
    Liao Mengyang(College of Electronic information
    Wuhan University Journal of Natural Sciences, 1998, (04) : 461 - 463
  • [50] Cluster-based deep convolutional networks for spectral reconstruction from RGB images
    Zou, Changzhong
    Wei, Minghui
    NEUROCOMPUTING, 2021, 464 (464) : 342 - 351