HAM-GNN: A hierarchical attention-based multi-dimensional edge graph neural network for dialogue act classification

被引:1
|
作者
Fu, Changzeng [1 ,2 ,5 ]
Su, Yikai [1 ]
Su, Kaifeng [1 ]
Liu, Yinghao [1 ]
Shi, Jiaqi [2 ,3 ]
Wu, Bowen [2 ,3 ]
Liu, Chaoran [3 ,4 ]
Ishi, Carlos Toshinori [3 ]
Ishiguro, Hiroshi [2 ]
机构
[1] Northeastern Univ, 143 Taishan Rd, Qinhuangdao, Hebei, Peoples R China
[2] Osaka Univ, 1-3 Mahcikaneyama, Toyonaka, Osaka, Japan
[3] RIKEN, 2-2-2 Hikaridai Seika, Kyoto, Japan
[4] Natl Inst Informat, 2-1-2 Hitotsubashi,Chiyoda Ku, Tokyo, Japan
[5] Northeastern Univ Qinhuangdao, Hebei Key Lab Marine Percept Network & Data Proc, Qinhuangdao, Peoples R China
关键词
Dialogue act; Graph neural networks; Attention mechanism;
D O I
10.1016/j.eswa.2024.125459
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dialogue act (DA) analysis is crucial for developing natural conversational systems and dialogue generation. Modelling DA labels at the utterance-level requires contextual and speaker-aware understanding, especially for conversational agents handling Japanese dialogues. In this study, we propose a Hierarchical Attention- based Multi-dimensional Edge Graph Neural Network (HAM-GNN) to effectively model DA labels by capturing speaker interconnections and contextual semantics. Specifically, long short-term memory networks (LSTMs) first encode contextual information within a dialogue window. We then construct a context graph by aggregating neighbouring utterances and apply a graph attention network (GAT) to model speaker interactions with multi-dimensional edges. To prevent incorrect edge definitions from completely deactivating connections during training, we initialize soft edges for nominally unconnected nodes with a small non-zero value. Moreover, to avoid loss of contextual information from localized graph construction, we utilize a convolutional Transformer (Conformer) to build residual connections. Subsequently, a gated graph convolutional network (GatedGCN) selects salient utterances for DA classification. Finally, multi-level representations are merged and fed to dense layers for classification. We evaluate our HAM-GNN model on the Japanese DA dataset (JPS-DA) and the English Switchboard DA dataset (SWDA). Results show our method outperforms baselines on JPS-DA and achieves competitive performance on SWDA. The graph-based architecture effectively encodes utterance semantics and speaker relationships for DA prediction in conversational systems.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] An integrated fuzzy neural supervision and attention-based graph neural network for improving network clustering
    Vo, Tham
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (33): : 24015 - 24035
  • [32] Crysformer: An attention-based graph neural network for properties prediction of crystals
    王田
    陈家辉
    滕婧
    史金钢
    曾新华
    Hichem Snoussi
    Chinese Physics B, 2023, (09) : 26 - 31
  • [33] Crysformer: An attention-based graph neural network for properties prediction of crystals
    Wang, Tian
    Chen, Jiahui
    Teng, Jing
    Shi, Jingang
    Zeng, Xinhua
    Snoussi, Hichem
    CHINESE PHYSICS B, 2023, 32 (09)
  • [34] Neural network systems for multi-dimensional temporal pattern classification
    Srinivasan, R
    Wang, C
    Ho, WK
    Lim, KW
    COMPUTERS & CHEMICAL ENGINEERING, 2005, 29 (05) : 965 - 981
  • [35] AST-GNN: An attention-based spatio-temporal graph neural network for Interaction-aware pedestrian trajectory prediction
    Zhou, Hao
    Ren, Dongchun
    Xia, Huaxia
    Fan, Mingyu
    Yang, Xu
    Huang, Hai
    NEUROCOMPUTING, 2021, 445 : 298 - 308
  • [36] Multi-scale attention-based convolutional neural network for classification of breast masses in mammograms
    Niu, Jing
    Li, Hua
    Zhang, Chen
    Li, Dengao
    MEDICAL PHYSICS, 2021, 48 (07) : 3878 - 3892
  • [37] MDAEN: Multi-Dimensional Attention-based Ensemble Network in Deep Reinforcement Learning Framework for Portfolio Management
    Zhang, Ruiyu
    Ren, Xiaotian
    Gu, Fengchen
    Stefanidis, Angelos
    Sun, Ruoyu
    Su, Jionglong
    2022 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY, CYBERC, 2022, : 143 - 151
  • [38] MERG: MULTI-DIMENSIONAL EDGE REPRESENTATION GENERATION LAYER FOR GRAPH NEURAL NETWORKS
    Song, Yuxin
    Luo, Cheng
    Jackson, Aaron
    Jia, Xi
    Xie, Weicheng
    Shen, Linlin
    Gunes, Hatice
    Song, Siyang
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 5475 - 5479
  • [39] Classification of Camellia oleifera Diseases in Complex Environments by Attention and Multi-Dimensional Feature Fusion Neural Network
    Chen, Yixin
    Wang, Xiyun
    Chen, Zhibo
    Wang, Kang
    Sun, Ye
    Jiang, Jiarong
    Liu, Xuhao
    PLANTS-BASEL, 2023, 12 (14):
  • [40] DOCUMENT CLASSIFICATION BASED ON CONVOLUTIONAL NEURAL NETWORK AND HIERARCHICAL ATTENTION NETWORK
    Cheng, Y.
    Ye, Z.
    Wang, M.
    Zhang, Q.
    NEURAL NETWORK WORLD, 2019, 29 (02) : 83 - 98