A graph-based solution for writer identification from handwritten text

被引:2
|
作者
Rahman, Atta Ur [1 ]
Halim, Zahid [1 ]
机构
[1] Ghulam Ishaq Khan Inst Engn Sci & Technol, Fac Comp Sci & Engn, Machine Intelligence Res Grp MInG, Topi, Pakistan
关键词
Writer identification; Preprocessing; Graph-based representation; Feature extraction; Ensemble learning; INDIVIDUALITY; CODEBOOK; FEATURES;
D O I
10.1007/s10115-022-01676-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Writer identification is an active research problem due to its applications in forensic and historic documents analysis. It is challenging to identify a writer from her handwritten characters' shapes produced via practiced writing style. Different writing shapes, styles, orientations, various sizes of characters, complex structures, inconsistency, and cursive nature of the text make it a tougher undertaking. To solve this problem, we need to explore a structural representation and spatial information of the handwritten characters. For this, a novel graph-based approach is proposed here to spatially map the handwritten text, adapt its structure, size, and explore the relationship that exist between them. First, image processing steps such as binarization, baseline correction, separation of the writing region, and thinning of the strokes to a width of a single pixel are executed. This work presents a novel algorithm for detecting key points (KPs) in a handwritten skeleton image and extracting their two-dimensional pixel coordinates values. The handwriting samples are then transformed into a graph-based representation with KPs representing nodes and the line segments connecting adjacent KPs as the edges. Features are extracted from the graph-based representations of the handwritten text. For classification, ensemble learning approaches are employed. Four benchmark datasets and one custom collected dataset are utilized for experimentations. The proposed solution achieves identification accuracies of 98.26%, 98.84%, 99.67%, 98.51%, and 97.73%, on CERUG-EN, CVL, Firemaker, IAM, and custom datasets, respectively.
引用
收藏
页码:1501 / 1523
页数:23
相关论文
共 50 条
  • [41] EdgeSumm: Graph-based framework for automatic text summarization
    El-Kassas, Wafaa S.
    Salama, Cherif R.
    Rafea, Ahmed A.
    Mohamed, Hoda K.
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (06)
  • [42] An Orthographic Similarity Measure for Graph-Based Text Representations
    Deforche, Maxime
    De Vos, Ilse
    Bronselaer, Antoon
    De Tre, Guy
    FLEXIBLE QUERY ANSWERING SYSTEMS, FQAS 2023, 2023, 14113 : 206 - 218
  • [43] Graph-Based Text Summarization Using Modified TextRank
    Mallick, Chirantana
    Das, Ajit Kumar
    Dutta, Madhurima
    Das, Asit Kumar
    Sarkar, Apurba
    SOFT COMPUTING IN DATA ANALYTICS, SCDA 2018, 2019, 758 : 137 - 146
  • [44] Graph-Based Approach for Cross Domain Text Linking
    Hu, Yu
    Nie, Tiezheng
    Shen, Derong
    Kou, Yue
    WEB TECHNOLOGIES AND APPLICATIONS, APWEB 2015 WORKSHOPS, 2015, 9461 : 151 - 160
  • [45] Knowledge Graph-based Algorithm for Text Data Mining
    Zhao, Yu-Feng
    He, Jie
    Journal of Network Intelligence, 2024, 9 (03): : 1892 - 1906
  • [46] Knowledge Graph-Based Hierarchical Text Semantic Representation
    Wu, Yongliang
    Pan, Xiao
    Li, Jinghui
    Dou, Shimao
    Dong, Jiahao
    Wei, Dan
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2024, 2024
  • [47] Graph-Based Semantic Learning, Representation and Growth from Text: A Systematic Review
    Ali, Ismael
    Melton, Austin
    2019 13TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2019, : 118 - 123
  • [48] A graph-based method for reconstructing entities from coordination ellipsis in medical text
    Yuan, Chi
    Wang, Yongli
    Shang, Ning
    Li, Ziran
    Zhao, Ruxin
    Weng, Chunhua
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (09) : 1364 - 1373
  • [49] Unsupervised writer adaptation applied to handwritten text recognition
    Nosary, A
    Heutte, L
    Paquet, T
    PATTERN RECOGNITION, 2004, 37 (02) : 385 - 388
  • [50] Graph-based Turkish text normalization and its impact on noisy text processing
    Demir, Seniz
    Topcu, Berkay
    ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2022, 35