Automatic Script Identification in the Wild

被引:0
|
作者
Shi, Baoguang [1 ]
Yao, Cong [1 ]
Zhang, Chengquan [1 ]
Guo, Xiaowei [2 ]
Huang, Feiyue [2 ]
Bai, Xiang [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch EIC, Wuhan 430074, Peoples R China
[2] Tecent, Shanghai 200233, Peoples R China
关键词
FRAMEWORK; TEXTURE; IMAGES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid increase of transnational communication and cooperation, people frequently encounter multilingual scenarios in various situations. In this paper, we are concerned with a relatively new problem: script identification at word or line levels in natural scenes. A large-scale dataset with a great quantity of natural images and 10 types of widely-used languages is constructed and released. In allusion to the challenges in script identification in real-world scenarios, a deep learning based algorithm is proposed. The experiments on the proposed dataset demonstrate that our algorithm achieves superior performance, compared with conventional image classification or script identification methods, including as the original CNN architecture, LLC and GLCM.
引用
收藏
页码:531 / 535
页数:5
相关论文
共 50 条
  • [41] Automatic Indic script identification from handwritten documents: page, block, line and word-level approach
    Sk Md Obaidullah
    K. C. Santosh
    Chayan Halder
    Nibaran Das
    Kaushik Roy
    International Journal of Machine Learning and Cybernetics, 2019, 10 : 87 - 106
  • [42] Automatic Indic script identification from handwritten documents: page, block, line and word-level approach
    Obaidullah, Sk Md
    Santosh, K. C.
    Halder, Chayan
    Das, Nibaran
    Roy, Kaushik
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (01) : 87 - 106
  • [43] Script identification in printed bilingual documents
    D. Dhanya
    A. G. Ramakrishnan
    Peeta Basa Pati
    Sadhana, 2002, 27 : 73 - 82
  • [44] Text and Script Independent Writer Identification
    Dhandra, B. V.
    Vijayalaxmi, M. B.
    2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 586 - 590
  • [45] Script identification in printed bilingual documents
    Dhanya, D
    Ramakrishnan, AG
    DOCUMENT ANALYSIS SYSTEM V, PROCEEDINGS, 2002, 2423 : 13 - 24
  • [46] SCRIPT-BASED CUES IN IDENTIFICATION
    CARROLL, M
    FREEBODY, P
    ACTA PSYCHOLOGICA, 1987, 64 (02) : 105 - 121
  • [47] Script identification of document image analysis
    Cheng, Juan
    Ping, Xijian
    Zhou, Guanwei
    Yang, Yang
    ICICIC 2006: FIRST INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING, INFORMATION AND CONTROL, VOL 3, PROCEEDINGS, 2006, : 178 - +
  • [48] Font Identification - In Context of an Indic Script
    Chanda, Sukalpa
    Pal, Umapada
    Franke, Katrin
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1655 - 1658
  • [49] Script identification from Indian documents
    Joshi, GD
    Carg, S
    Sivaswamy, J
    DOCUMENT ANALYSIS SYSTEMS VII, PROCEEDINGS, 2006, 3872 : 255 - 267
  • [50] Script Identification Based on HSV Features
    Mijit, Buvajar
    Aysa, Alimjan
    Yadikar, Nurbiya
    Han, Xing-Kun
    Ubul, Kurban
    PATTERN RECOGNITION (CCPR 2016), PT II, 2016, 663 : 588 - 597