SGFNet: A semantic graph-based multimodal network for financial invoice information extraction

被引:0
|
作者
Luo, Shun [1 ]
Yu, Juan [1 ]
机构
[1] Fuzhou Univ, Sch Econ & Management, 2 Wulongjiang North Ave, Fuzhou 350108, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Invoice information extraction; Semantic graph; Multimodal modeling;
D O I
10.1016/j.eswa.2024.125156
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To meet the demand for a large amount of invoice entry work in the financial industry and improve the low accuracy of traditional manual entry, we construct SGFNet, a financial invoice information extraction network that integrates semantic graph associations and multimodal modeling. First, we construct a graph of strong and weak semantic associations between data within each modality based on the correlation of text content. Subsequently, we model the multimodal data in a unified structure, extract the text modal information of invoices along with corresponding image and layout modal information, and guide the fusion and embedding of multimodal data through semantic associations in the graph to produce a richer feature representation. Furthermore, semantically linked multimodal information is fed into an aggregated multimodal self-attention mechanism to establish effective connection between modalities. Finally, with the combination of supervised contrastive learning and smoothed Kullback-Leibler divergence in terms of loss functions, the accuracy degradation problem incurred by sample imbalance and convergence instability is reduced. In our experiments, we achieved F1 scores of 93.71% for the English financial invoice dataset and 96.27% for the Chinese dataset, indicating that the proposed method successfully extracts feature information from different data modalities, thereby achieving satisfactory information extraction results.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Role of Semantic Links in Performance of Information Retrieval on Graph-based Multimodal Collections
    Sabetghadam, Serwah
    Lupu, Mihai
    Rauber, Andreas
    2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 1574 - 1579
  • [2] GraphIE: A Graph-Based Framework for Information Extraction
    Qian, Yujie
    Santos, Enrico
    Jin, Zhijing
    Guo, Jiang
    Barzilay, Regina
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 751 - 761
  • [3] Enhanced Semantic Understanding with Graph-Based Information Retrieval
    De Filippis, Giovanni M.
    Rinaldi, Antonio M.
    Russo, Cristiano
    Tommasino, Cristian
    ADVANCES ON GRAPH-BASED APPROACHES IN INFORMATION RETRIEVAL, IRONGRAPHS 2024, 2025, 2197 : 11 - 24
  • [4] A GRAPH-BASED APPROACH FOR FEATURE EXTRACTION AND SEGMENTATION OF MULTIMODAL IMAGES
    Iyer, Geoffrey
    Chanussot, Jocelyn
    Bertozzi, Andrea L.
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3320 - 3324
  • [5] Multimodal Graph-based Transformer Framework for Biomedical Relation Extraction
    Pingali, Sriram
    Yadav, Shweta
    Dutta, Pratik
    Saha, Sriparna
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 3741 - 3747
  • [6] Graph-based Semantic Evolution for Context Information Management Platforms
    Li, Wenbin
    Privat, Gilles
    Cantera, Jose Manuel
    Bauer, Martin
    Le Gall, Franck
    2018 GLOBAL INTERNET OF THINGS SUMMIT (GIOTS), 2018, : 234 - 239
  • [7] Exploiting Semantic Information for Graph-Based Recommendations of Learning Resources
    Anjorin, Mojisola
    Rodenhausen, Thomas
    Garcia, Renato Dominguez
    Rensing, Christoph
    21ST CENTURY LEARNING FOR 21ST CENTURY SKILLS, 2012, 7563 : 9 - 22
  • [8] GraphRevisedIE: Multimodal information extraction with graph-revised network
    Cao, Panfeng
    Wu, Jian
    PATTERN RECOGNITION, 2023, 140
  • [9] Graph-Based Semantic Segmentation
    Balaska, Vasiliki
    Bampis, Loukas
    Gasteratos, Antonios
    ADVANCES IN SERVICE AND INDUSTRIAL ROBOTICS, RAAD 2018, 2019, 67 : 572 - 579
  • [10] A graph-based sensor recommendation model in semantic sensor network
    Chen, Yuanyi
    Lin, Yihao
    Yu, Peng
    Tao, Yanyun
    Zheng, Zengwei
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2022, 18 (05):