Connecting the Dots: What Graph-Based Text Representations Work Best for Text Classification using Graph Neural Networks?

被引:0
|
作者
Bugueno, Margarita [1 ]
de Melo, Gerard [1 ]
机构
[1] Univ Potsdam, Hasso Plattner Inst HPI, Potsdam, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given the success of Graph Neural Networks (GNNs) for structure-aware machine learning, many studies have explored their use for text classification, but mostly in specific domains with limited data characteristics. Moreover, some strategies prior to GNNs relied on graph mining and classical machine learning, making it difficult to assess their effectiveness in modern settings. This work extensively investigates graph representation methods for text classification, identifying practical implications and open challenges. We compare different graph construction schemes using a variety of GNN architectures and setups across five datasets, encompassing short and long documents as well as unbalanced scenarios in diverse domains. Two Transformer-based large language models are also included to complement the study. The results show that i) although the effectiveness of graphs depends on the textual input features and domain, simple graph constructions perform better the longer the documents are, ii) graph representations are especially beneficial for longer documents, outperforming Transformer-based models, iii) graph methods are particularly efficient at solving the task.
引用
收藏
页码:8943 / 8960
页数:18
相关论文
共 50 条
  • [21] Domain-Adversarial Graph Neural Networks for Text Classification
    Wu, Man
    Pan, Shirui
    Zhu, Xingquan
    Zhou, Chuan
    Pan, Lei
    2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 648 - 657
  • [22] Graph-Based Text Summarization Using Modified TextRank
    Mallick, Chirantana
    Das, Ajit Kumar
    Dutta, Madhurima
    Das, Asit Kumar
    Sarkar, Apurba
    SOFT COMPUTING IN DATA ANALYTICS, SCDA 2018, 2019, 758 : 137 - 146
  • [23] Graph-based Text Representation for Malay Translated Hadith Text
    Alias, Nursyahidah
    Abd Rahman, Nurazzah
    Ismail, Normaly Kamal
    Nor, Zulhilmi Mohamed
    Alias, Muhammad Nazir
    2016 THIRD INTERNATIONAL CONFERENCE ON INFORMATION RETRIEVAL AND KNOWLEDGE MANAGEMENT (CAMP), 2016, : 60 - 66
  • [24] Graph-based extractive text summarization method for Hausa text
    Bichi, Abdulkadir Abubakar
    Samsudin, Ruhaidah
    Hassan, Rohayanti
    Hasan, Layla Rasheed Abdallah
    Rogo, Abubakar Ado
    PLOS ONE, 2023, 18 (05):
  • [25] Graph-Based Audio Classification Using Pre-Trained Models and Graph Neural Networks
    Castro-Ospina, Andres Eduardo
    Solarte-Sanchez, Miguel Angel
    Vega-Escobar, Laura Stella
    Isaza, Claudia
    Martinez-Vargas, Juan David
    SENSORS, 2024, 24 (07)
  • [26] Supervised Graph-Based Term Weighting Scheme for Effective Text Classification
    Shanavas, Niloofer
    Wang, Hui
    Lin, Zhiwei
    Hawe, Glenn
    ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 1710 - 1711
  • [27] Review of Graph Neural Network in Text Classification
    Malekzadeh, Masoud
    Hajibabaee, Parisa
    Heidari, Maryam
    Zad, Samira
    Uzuner, Ozlem
    Jones, James H. Jr Jr
    2021 IEEE 12TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2021, : 84 - 91
  • [28] Knowledge-enhanced graph convolutional neural networks for text classification
    Wang T.
    Zhu X.-F.
    Tang G.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2022, 56 (02): : 322 - 328
  • [29] A Novel Graph Neural Network Based Model for Text Classification
    Xiong, Rui
    Zheng, Hongying
    Wang, Zongbing
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT VII, 2024, 15022 : 64 - 78
  • [30] Extractive Text Summarization Using Ontology and Graph-Based Method
    Yongkiatpanich, Chuleepohn
    Wichadakul, Duangdao
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2019), 2019, : 105 - 110