Connecting the Dots: What Graph-Based Text Representations Work Best for Text Classification using Graph Neural Networks?

被引:0
|
作者
Bugueno, Margarita [1 ]
de Melo, Gerard [1 ]
机构
[1] Univ Potsdam, Hasso Plattner Inst HPI, Potsdam, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given the success of Graph Neural Networks (GNNs) for structure-aware machine learning, many studies have explored their use for text classification, but mostly in specific domains with limited data characteristics. Moreover, some strategies prior to GNNs relied on graph mining and classical machine learning, making it difficult to assess their effectiveness in modern settings. This work extensively investigates graph representation methods for text classification, identifying practical implications and open challenges. We compare different graph construction schemes using a variety of GNN architectures and setups across five datasets, encompassing short and long documents as well as unbalanced scenarios in diverse domains. Two Transformer-based large language models are also included to complement the study. The results show that i) although the effectiveness of graphs depends on the textual input features and domain, simple graph constructions perform better the longer the documents are, ii) graph representations are especially beneficial for longer documents, outperforming Transformer-based models, iii) graph methods are particularly efficient at solving the task.
引用
收藏
页码:8943 / 8960
页数:18
相关论文
共 50 条
  • [1] Software bug prediction using graph neural networks and graph-based text representations
    Siachos, Ilias
    Kanakaris, Nikos
    Karacapilidis, Nikos
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 259
  • [2] Bipartite Graph Coarsening for Text Classification Using Graph Neural Networks
    dos Santos, Nicolas Roque
    Minatel, Diego
    Baria Valejo, Alan Demetrius
    Lopes, Alneu de A.
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2023, PT I, 2024, 14469 : 589 - 604
  • [3] Multi-view Graph-Based Text Representations for Imbalanced Classification
    Karajeh, Ola
    Lourentzou, Ismini
    Fox, Edward A.
    LINKING THEORY AND PRACTICE OF DIGITAL LIBRARIES, TPDL 2023, 2023, 14241 : 249 - 264
  • [4] Graph-based Text Classification by Contrastive Learning with Text-level Graph Augmentation
    Li, Ximing
    Wang, Bing
    Wang, Yang
    Wang, Meng
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (04)
  • [5] A Graph-Based Measurement for Text Imbalance Classification
    Tian, Jiachen
    Chen, Shizhan
    Zhang, Xiaowang
    Feng, Zhiyong
    ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 2188 - 2195
  • [6] Recurrent Graph Neural Networks for Text Classification
    Wei, Xinde
    Huang, Hai
    Ma, Longxuan
    Yang, Ze
    Xu, Liutong
    PROCEEDINGS OF 2020 IEEE 11TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2020), 2020, : 91 - 97
  • [7] Graph neural networks for text classification: a survey
    Wang, Kunze
    Ding, Yihao
    Han, Soyeon Caren
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (08)
  • [8] An Orthographic Similarity Measure for Graph-Based Text Representations
    Deforche, Maxime
    De Vos, Ilse
    Bronselaer, Antoon
    De Tre, Guy
    FLEXIBLE QUERY ANSWERING SYSTEMS, FQAS 2023, 2023, 14113 : 206 - 218
  • [9] Review of Text Classification Methods Based on Graph Neural Networks
    Su, Yilei
    Li, Weijun
    Liu, Xueyang
    Ding, Jianping
    Liu, Shixia
    Li, Haonan
    Li, Guanfeng
    Computer Engineering and Applications, 2024, 60 (19) : 1 - 17
  • [10] KGAT: An Enhanced Graph-Based Model for Text Classification
    Wang, Xin
    Wang, Chao
    Yang, Haiyang
    Zhang, Xingpeng
    Shen, Qi
    Ji, Kan
    Wu, Yuhong
    Zhan, Huayi
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT I, 2022, 13551 : 656 - 668