Multilingual Universal Sentence Encoder for Semantic Retrieval

被引:0
|
作者
Yang, Yinfei [1 ]
Cer, Daniel [1 ]
Ahmad, Amin [1 ]
Guo, Mandy [1 ]
Law, Jax [1 ]
Constant, Noah [1 ]
Abrego, Gustavo Hernandez [1 ]
Yuan, Steve [2 ]
Tar, Chris [1 ]
Sung, Yun-Hsuan [1 ]
Strope, Brian [1 ]
Kurzweil, Ray [1 ]
机构
[1] Google AI, Mountain View, CA 94043 USA
[2] Google, Cambridge, MA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present easy-to-use retrieval focused multilingual sentence embedding models, made available on TensorFlow Hub. The models embed text from 16 languages into a shared semantic space using a multi-task trained dual-encoder that learns tied cross-lingual representations via translation bridge tasks (Chidambaram et al., 2018). The models achieve a new state-of-the-art in performance on monolingual and cross-lingual semantic retrieval (SR). Competitive performance is obtained on the related tasks of translation pair bitext retrieval (BR) and retrieval question answering (ReQA). On transfer learning tasks, our multilingual embeddings approach, and in some cases exceed, the performance of English only sentence embeddings.
引用
收藏
页码:87 / 94
页数:8
相关论文
共 50 条
  • [31] Multiple Visual-Semantic Embedding for Video Retrieval from Query Sentence
    Nguyen, Huy Manh
    Miyazaki, Tomo
    Sugaya, Yoshihiro
    Omachi, Shinichiro
    APPLIED SCIENCES-BASEL, 2021, 11 (07):
  • [32] Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval
    Ge, Xuri
    Chen, Fuhai
    Xu, Songpei
    Tao, Fuxiang
    Jose, Joemon M.
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 1022 - 1031
  • [33] A multilingual approach to multilingual information retrieval
    Nie, JY
    Jin, F
    ADVANCES IN CROSS-LANGUAGE INFORMATION RETRIEVAL, 2003, 2785 : 101 - 110
  • [34] Cross-Modal Retrieval Based on Semantic Auto-Encoder and Hash Learning
    Lu, Zhu
    Fang, Deng
    Kun, Liu
    Tingting, He
    Yuanyuan, Liu
    Data Analysis and Knowledge Discovery, 2021, 5 (12) : 110 - 122
  • [35] Design and implementation of the multilingual product retrieval agent through XML and the semantic networks in EC
    Moon, YJ
    Choi, K
    Min, K
    Kim, WP
    Hwang, Y
    Kim, P
    Mun, Y
    SERVICE-ORIENTED COMPUTING - ICSOC 2003, 2003, 2910 : 423 - 433
  • [36] Achieving Semantic Consistency for Multilingual Sentence Representation Using an Explainable Machine Natural Language Parser (MParser)
    Qin, Peng
    Tan, Weiming
    Guo, Jingzhi
    Shen, Bingqing
    Tang, Qian
    APPLIED SCIENCES-BASEL, 2021, 11 (24):
  • [37] Unsupervised multilingual sentence boundary detection
    Kiss, Tibor
    Strunk, Jan
    COMPUTATIONAL LINGUISTICS, 2006, 32 (04) : 485 - 525
  • [38] Multi-task Sentence Encoding Model for Semantic Retrieval in Question Answering Systems
    Huang, Qiang
    Bu, Jianhui
    Xie, Weijian
    Yang, Shengwen
    Wu, Weijia
    Liu, Liping
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [39] Adaptive multilingual sentence boundary disambiguation
    Palmer, DD
    Hearst, MA
    COMPUTATIONAL LINGUISTICS, 1997, 23 (02) : 241 - 267
  • [40] EFFECTS OF RELATION STRENGTH AND SEMANTIC OVERLAP ON RETRIEVAL AND COMPARISON PROCESSES DURING SENTENCE VERIFICATION
    LORCH, RF
    JOURNAL OF VERBAL LEARNING AND VERBAL BEHAVIOR, 1981, 20 (06): : 593 - 610