Cross-lingual Text Classification with Heterogeneous Graph Neural Network

被引:0
|
作者
Wang, Ziyun [1 ]
Liu, Xuan [1 ]
Yang, Peiji [1 ]
Liu, Shixing [1 ]
Wang, Zhisheng [1 ]
机构
[1] Tencent, Shenzhen, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-lingual text classification aims at training a classifier on the source language and transferring the knowledge to target languages, which is very useful for low-resource languages. Recent multilingual pretrained language models (mPLM) achieve impressive results in cross-lingual classification tasks, but rarely consider factors beyond semantic similarity, causing performance degradation between some language pairs. In this paper we propose a simple yet effective method to incorporate heterogeneous information within and across languages for cross-lingual text classification using graph convolutional networks (GCN). In particular, we construct a heterogeneous graph by treating documents and words as nodes, and linking nodes with different relations, which include part-of-speech roles, semantic similarity, and document translations. Extensive experiments show that our graph-based method significantly outperforms state-of-the-art models on all tasks, and also achieves consistent performance gain over baselines in low-resource settings where external tools like translators are unavailable.
引用
收藏
页码:612 / 620
页数:9
相关论文
共 50 条
  • [1] An Integrated Topic Modelling and Graph Neural Network for Improving Cross-lingual Text Classification
    Tham Vo
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (01)
  • [2] Cross-lingual Transfer for Text Classification with Dictionary-based Heterogeneous Graph
    Chairatanakul, Nuttapong
    Sriwatanasakdi, Noppayut
    Charoenphakdee, Nontawat
    Liu, Xin
    Murata, Tsuyoshi
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1504 - 1517
  • [3] Heterogeneous Document Embeddings for Cross-Lingual Text Classification
    Moreo, Alejandro
    Pedrotti, Andrea
    Sebastiani, Fabrizio
    36TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2021, 2021, : 685 - 688
  • [4] Cross-lingual Aspect-level Sentiment Classification with Graph Neural Network
    Bao X.-Y.
    Jiang X.-T.
    Wang Z.-Q.
    Zhou G.-D.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (02): : 676 - 689
  • [5] Cross-lingual Distillation for Text Classification
    Xu, Ruochen
    Yang, Yiming
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1415 - 1425
  • [6] Cross-lingual Knowledge Graph Alignment via Graph Matching Neural Network
    Xu, Kun
    Wang, Liwei
    Yu, Mo
    Feng, Yansong
    Song, Yan
    Wang, Zhiguo
    Yu, Dong
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3156 - 3161
  • [7] Heterogeneous Graph Neural Network for Short Text Classification
    Zhang, Bingjie
    He, Qing
    Zhang, Damin
    APPLIED SCIENCES-BASEL, 2022, 12 (17):
  • [8] Generalized Funnelling: Ensemble Learning and Heterogeneous Document Embeddings for Cross-Lingual Text Classification
    Moreo, Alejandro
    Pedrotti, Andrea
    Sebastiani, Fabrizio
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2023, 41 (02)
  • [9] Transductive Representation Learning for Cross-Lingual Text Classification
    Guo, Yuhong
    Xiao, Min
    12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012), 2012, : 888 - 893
  • [10] Bi-Neighborhood Graph Neural Network for cross-lingual entity alignment
    Shi, Xinchen
    Li, Bin
    Chen, Ling
    Yang, Chao
    KNOWLEDGE-BASED SYSTEMS, 2023, 277