MTLAN: Multi-Task Learning and Auxiliary Network for Enhanced Sentence Embedding

被引:0
|
作者
Liu, Gang [1 ,2 ]
Wang, Tongli [1 ]
Yang, Wenli [1 ]
Yan, Zhizheng [1 ]
Zhan, Kai [3 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin, Peoples R China
[2] Harbin Engn Univ, Modeling & Emulat E Govt Natl Engn Lab, Harbin, Peoples R China
[3] PwC Enterprise Digital, PricewaterhouseCoopers, Sydney, NSW, Australia
关键词
Cross-lingual; Sentence embedding; Multi-task learning; Contrastive learning; Auxiliary network;
D O I
10.1007/978-981-99-8067-3_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The objective of cross-lingual sentence embedding learning is to map sentences into a shared representation space, where semantically similar sentence representations are closer together, while distinct sentence representations exhibit clear differentiation. This paper proposes a novel sentence embedding model called MTLAN, which incorporates multi-task learning and auxiliary networks. The model utilizes the LaBSE model for extracting sentence features and undergoes joint training on tasks related to sentence semantic representation and distance measurement. Furthermore, an auxiliary network is employed to enhance the contextual expression of words within sentences. To address the issue of limited resources for low-resource languages, we construct a pseudocorpus dataset using a multilingual dictionary for unsupervised learning. We conduct experiments on multiple publicly available datasets, including STS and SICK, to evaluate both monolingual sentence similarity and cross-lingual semantic similarity. The empirical results demonstrate the significant superiority of our proposed model over state-of-the-art methods.
引用
收藏
页码:16 / 27
页数:12
相关论文
共 50 条
  • [21] Multi-Task Metric Learning on Network Data
    Fang, Chen
    Rockmore, Daniel N.
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PART I, 2015, 9077 : 317 - 329
  • [22] Distributed Multi-task Learning for Sensor Network
    Li, Jiyi
    Arai, Tomohiro
    Baba, Yukino
    Kashima, Hisashi
    Miwa, Shotaro
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2017, PT II, 2017, 10535 : 657 - 672
  • [23] Keeping Consistency of Sentence Generation and Document Classification with Multi-Task Learning
    Nishino, Toru
    Misawa, Shotaro
    Kano, Ryuji
    Taniguchi, Tomoki
    Miura, Yasuhide
    Ohkuma, Tomoko
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 3195 - 3205
  • [24] Sentence-graph-level knowledge injection with multi-task learning
    Chen, Liyi
    Wang, Runze
    Shi, Chen
    Yuan, Yifei
    Liu, Jie
    Hu, Yuxiang
    Jiang, Feijun
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2025, 28 (01):
  • [25] Multi-task gradient descent for multi-task learning
    Lu Bai
    Yew-Soon Ong
    Tiantian He
    Abhishek Gupta
    Memetic Computing, 2020, 12 : 355 - 369
  • [26] Multi-task gradient descent for multi-task learning
    Bai, Lu
    Ong, Yew-Soon
    He, Tiantian
    Gupta, Abhishek
    MEMETIC COMPUTING, 2020, 12 (04) : 355 - 369
  • [27] Multi-task Projected Embedding for Igbo
    Ezeani, Ignatius
    Hepple, Mark
    Onyenwe, Ikechukwu
    Enemuo, Chioma
    TEXT, SPEECH, AND DIALOGUE (TSD 2018), 2018, 11107 : 285 - 294
  • [28] Multi-Task Learning for Email Search Ranking with Auxiliary Query Clustering
    Shen, Jiaming
    Karimzadehgan, Maryam
    Bendersky, Michael
    Qin, Zhen
    Metzler, Donald
    CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 2127 - 2135
  • [29] Pupil center detection inspired by multi-task auxiliary learning characteristic
    Xiang, Zheng
    Zhao, Xinbo
    Fang, Aiqing
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (28) : 40067 - 40088
  • [30] Pupil center detection inspired by multi-task auxiliary learning characteristic
    Zheng Xiang
    Xinbo Zhao
    Aiqing Fang
    Multimedia Tools and Applications, 2022, 81 : 40067 - 40088