Automatic Script Identification in the Wild

被引:0
|
作者
Shi, Baoguang [1 ]
Yao, Cong [1 ]
Zhang, Chengquan [1 ]
Guo, Xiaowei [2 ]
Huang, Feiyue [2 ]
Bai, Xiang [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch EIC, Wuhan 430074, Peoples R China
[2] Tecent, Shanghai 200233, Peoples R China
关键词
FRAMEWORK; TEXTURE; IMAGES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid increase of transnational communication and cooperation, people frequently encounter multilingual scenarios in various situations. In this paper, we are concerned with a relatively new problem: script identification at word or line levels in natural scenes. A large-scale dataset with a great quantity of natural images and 10 types of widely-used languages is constructed and released. In allusion to the challenges in script identification in real-world scenarios, a deep learning based algorithm is proposed. The experiments on the proposed dataset demonstrate that our algorithm achieves superior performance, compared with conventional image classification or script identification methods, including as the original CNN architecture, LLC and GLCM.
引用
收藏
页码:531 / 535
页数:5
相关论文
共 50 条
  • [1] Survey on Automatic Script Identification Techniques
    Donda, Miral, V
    Prajapati, Harshadkumar B.
    Dabhi, Vipul K.
    2019 IEEE 5TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2019,
  • [2] A Simple and Effective Solution for Script Identification in the Wild
    Singh, Ajeet Kumar
    Mishra, Anand
    Dabral, Pranav
    Jawahar, C. V.
    PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 428 - 433
  • [3] ICDAR 2021 Competition on Script Identification in the Wild
    Das, Abhijit
    Ferrer, Miguel A.
    Morales, Aythami
    Diaz, Moises
    Pal, Umapada
    Impedovo, Donato
    Li, Hongliang
    Yang, Wentao
    Ota, Kensho
    Yao, Tadahito
    Le Quang Hung
    Nguyen Quoc Cuong
    Kim, Seungjae
    Gattal, Abdeljalil
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV, 2021, 12824 : 738 - 753
  • [4] Script identification in the wild via discriminative convolutional neural network
    Shi, Baoguang
    Bai, Xiang
    Yao, Cong
    PATTERN RECOGNITION, 2016, 52 : 448 - 458
  • [5] Rotation invariant texture features and their use in automatic script identification
    Tan, TN
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (07) : 751 - 756
  • [6] Automatic feature selection with applications to script identification of degraded documents
    Ablavsky, V
    Stevens, MR
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 750 - 754
  • [7] Automatic identification of English, Chinese, Arabic, Devnagari and Bangla script line
    Pal, U
    Chaudhuri, BB
    SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 790 - 794
  • [8] An Approach for Automatic Indic Script Identification from Handwritten Document Images
    Obaidullah, Sk. Md.
    Halder, Chayan
    Das, Nibaran
    Roy, Kaushik
    ADVANCED COMPUTING AND SYSTEMS FOR SECURITY, VOL 2, 2016, 396 : 37 - 51
  • [9] Script Identification in the Wild with FFT-Multi-grained Mix Attention Transformer
    Pan, Zhi
    Yang, Yaowei
    Ubul, Kurban
    Aysa, Alimjan
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT II, 2024, 14805 : 104 - 117
  • [10] Automatic Signature-Based Writer Identification in Mixed-Script Scenarios
    Obaidullah, Sk Md
    Ghosh, Mridul
    Mukherjee, Himadri
    Roy, Kaushik
    Pal, Umapada
    DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II, 2021, 12822 : 364 - 377