MTLAN: Multi-Task Learning and Auxiliary Network for Enhanced Sentence Embedding

被引:0
|
作者
Liu, Gang [1 ,2 ]
Wang, Tongli [1 ]
Yang, Wenli [1 ]
Yan, Zhizheng [1 ]
Zhan, Kai [3 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin, Peoples R China
[2] Harbin Engn Univ, Modeling & Emulat E Govt Natl Engn Lab, Harbin, Peoples R China
[3] PwC Enterprise Digital, PricewaterhouseCoopers, Sydney, NSW, Australia
关键词
Cross-lingual; Sentence embedding; Multi-task learning; Contrastive learning; Auxiliary network;
D O I
10.1007/978-981-99-8067-3_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The objective of cross-lingual sentence embedding learning is to map sentences into a shared representation space, where semantically similar sentence representations are closer together, while distinct sentence representations exhibit clear differentiation. This paper proposes a novel sentence embedding model called MTLAN, which incorporates multi-task learning and auxiliary networks. The model utilizes the LaBSE model for extracting sentence features and undergoes joint training on tasks related to sentence semantic representation and distance measurement. Furthermore, an auxiliary network is employed to enhance the contextual expression of words within sentences. To address the issue of limited resources for low-resource languages, we construct a pseudocorpus dataset using a multilingual dictionary for unsupervised learning. We conduct experiments on multiple publicly available datasets, including STS and SICK, to evaluate both monolingual sentence similarity and cross-lingual semantic similarity. The empirical results demonstrate the significant superiority of our proposed model over state-of-the-art methods.
引用
收藏
页码:16 / 27
页数:12
相关论文
共 50 条
  • [41] Lexicon-Enhanced Multi-Task Convolutional Neural Network for Emotion Distribution Learning
    Dong, Yuchang
    Zeng, Xueqiang
    AXIOMS, 2022, 11 (04)
  • [42] Expressive user embedding from churn and recommendation multi-task learning
    Bai, Huajun
    Liu, Davide
    Hirtz, Thomas
    Boulenger, Alexandre
    COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 37 - 40
  • [43] Enhanced task attention with adversarial learning for dynamic multi-task CNN
    Fang, Yuchun
    Xiao, Shiwei
    Zhou, Menglu
    Cai, Sirui
    Zhang, Zhaoxiang
    PATTERN RECOGNITION, 2022, 128
  • [44] Multiple Relational Attention Network for Multi-task Learning
    Zhao, Jiejie
    Du, Bowen
    Sun, Leilei
    Zhuang, Fuzhen
    Lv, Weifeng
    Xiong, Hui
    KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 1123 - 1131
  • [45] Multi-task learning network for handwritten numeral recognition
    Hou, Jinhui
    Zeng, Huanqiang
    Cai, Lei
    Zhu, Jianqing
    Chen, Jing
    Cai, Canhui
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (02) : 843 - 850
  • [46] MCapsNet: Capsule Network for Text with Multi-Task Learning
    Xiao, Liqiang
    Zhang, Honglun
    Chen, Wenqing
    Wang, Yongkun
    Jin, Yaohui
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 4565 - 4574
  • [47] Multi-Task Adversarial Network for Disentangled Feature Learning
    Liu, Yang
    Wang, Zhaowen
    Jin, Hailin
    Wassell, Ian
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3743 - 3751
  • [48] Dynamic Multi-Task Learning with Convolutional Neural Network
    Fang, Yuchun
    Ma, Zhengyan
    Zhang, Zhaoxiang
    Zhang, Xu-Yao
    Bai, Xiang
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1668 - 1674
  • [49] Multi-task Transfer Learning for Bayesian Network Structures
    Benikhlef, Sarah
    Leray, Philippe
    Raschia, Guillaume
    Ben Messaoud, Montassar
    Sakly, Fayrouz
    SYMBOLIC AND QUANTITATIVE APPROACHES TO REASONING WITH UNCERTAINTY, ECSQARU 2021, 2021, 12897 : 217 - 228
  • [50] A multi-task learning network for skin disease classification
    Wang, W.
    Wang, Y.
    Zhao, S.
    Chen, X.
    JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2022, 142 (08) : S52 - S52