SGFNet: A semantic graph-based multimodal network for financial invoice information extraction

被引:0
|
作者
Luo, Shun [1 ]
Yu, Juan [1 ]
机构
[1] Fuzhou Univ, Sch Econ & Management, 2 Wulongjiang North Ave, Fuzhou 350108, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Invoice information extraction; Semantic graph; Multimodal modeling;
D O I
10.1016/j.eswa.2024.125156
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To meet the demand for a large amount of invoice entry work in the financial industry and improve the low accuracy of traditional manual entry, we construct SGFNet, a financial invoice information extraction network that integrates semantic graph associations and multimodal modeling. First, we construct a graph of strong and weak semantic associations between data within each modality based on the correlation of text content. Subsequently, we model the multimodal data in a unified structure, extract the text modal information of invoices along with corresponding image and layout modal information, and guide the fusion and embedding of multimodal data through semantic associations in the graph to produce a richer feature representation. Furthermore, semantically linked multimodal information is fed into an aggregated multimodal self-attention mechanism to establish effective connection between modalities. Finally, with the combination of supervised contrastive learning and smoothed Kullback-Leibler divergence in terms of loss functions, the accuracy degradation problem incurred by sample imbalance and convergence instability is reduced. In our experiments, we achieved F1 scores of 93.71% for the English financial invoice dataset and 96.27% for the Chinese dataset, indicating that the proposed method successfully extracts feature information from different data modalities, thereby achieving satisfactory information extraction results.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Graph-based Information Modeling for ICPS
    Biskupovic, Angel
    Nunez, Felipe
    2022 IEEE 20TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2022, : 47 - 52
  • [42] Graph-Based Processing of Macromolecular Information
    Munteanu, Cristian R.
    Aguiar-Pulido, Vanessa
    Freire, Ana
    Martinez-Romero, Marcos
    Porto-Pazos, Ana B.
    Pereira, Javier
    Dorado, Julian
    CURRENT BIOINFORMATICS, 2015, 10 (05) : 606 - 631
  • [43] Multimodal Graph-Based Dependency Parsing of Natural Language
    Salama, Amr Rekaby
    Menzel, Wolfgang
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016, 2017, 533 : 22 - 31
  • [44] Multimodal Graph-Based Reranking for Web Image Search
    Wang, Meng
    Li, Hao
    Tao, Dacheng
    Lu, Ke
    Wu, Xindong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2012, 21 (11) : 4649 - 4661
  • [45] A semantic graph-based keyword extraction model using ranking method on big social data
    R. Devika
    V. Subramaniyaswamy
    Wireless Networks, 2021, 27 : 5447 - 5459
  • [46] A semantic graph-based keyword extraction model using ranking method on big social data
    Devika, R.
    Subramaniyaswamy, V
    WIRELESS NETWORKS, 2021, 27 (08) : 5447 - 5459
  • [47] Graph-Based Analysis in Network Security
    Collins, M. Patrick
    2011 - MILCOM 2011 MILITARY COMMUNICATIONS CONFERENCE, 2011, : 1333 - 1337
  • [48] GRAPH-BASED KINSHIP REASONING NETWORK
    Li, Wanhua
    Zhang, Yingqiang
    Lv, Kangchen
    Lu, Jiwen
    Feng, Jianjiang
    Zhou, Jie
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [49] Graph-based network analysis in schizophrenia
    Micheloyannis, Sifis
    WORLD JOURNAL OF PSYCHIATRY, 2012, 2 (01): : 1 - 12
  • [50] AGMN: Association graph-based graph matching network for coronary artery semantic labeling on invasive coronary angiograms
    Zhao, Chen
    Xu, Zhihui
    Jiang, Jingfeng
    Esposito, Michele
    Pienta, Drew
    Hung, Guang-Uei
    Zhou, Weihua
    PATTERN RECOGNITION, 2023, 143