Word Representation Learning in Multimodal Pre-Trained Transformers: An Intrinsic Evaluation

被引：4

作者：

Pezzelle, Sandro ^{[1
]}

Takmaz, Ece ^{[1
]}

Fernandez, Raquel ^{[1
]}

机构：

[1] Univ Amsterdam, Inst Log Language & Computat, Amsterdam, Netherlands

来源：

TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS | 2021年 / 9卷

基金：

欧洲研究理事会;

关键词：

DISTRIBUTIONAL SEMANTICS; MODELS;

D O I：

10.1162/tacl_a_00443

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This study carries out a systematic intrinsic evaluation of the semantic representations learned by state-of-the-art pre-trained multimodal Transformers. These representations are claimed to be task-agnostic and shown to help on many downstream language-and-vision tasks. However, the extent to which they align with human semantic intuitions remains unclear. We experiment with various models and obtain static word representations from the contextualized ones they learn. We then evaluate them against the semantic judgments provided by human speakers. In linewith previous evidence, we observe a generalized advantage of multimodal representations over languageonly ones on concrete word pairs, but not on abstract ones. On the one hand, this confirms the effectiveness of these models to align language and vision, which results in better semantic representations for concepts that are grounded in images. On the other hand, models are shown to follow different representation learning patterns, which sheds some light on how and when they perform multimodal integration.

引用

页码：1563 / 1579

页数：17

共 50 条

[41] Sparse Pairwise Re-ranking with Pre-trained Transformers
Gienapp, Lukas
Froebe, Maik
Hagen, Matthias
Potthast, Martin
PROCEEDINGS OF THE 2022 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2022, 2022, : 250 - 258
[42] DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering
Cao, Qingqing
Trivedi, Harsh
Balasubramanian, Aruna
Balasubramanian, Niranjan
58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 4487 - 4497
[43] Towards Summarizing Code Snippets Using Pre-Trained Transformers
Mastropaolo, Antonio
Tufano, Rosalia
Ciniselli, Matteo
Aghajani, Emad
Pascarella, Luca
Bavota, Gabriele
arXiv, 1600,
[44] Routing Generative Pre-Trained Transformers for Printed Circuit Board
Wang, Hao
Tu, Jun
Bai, Shenglong
Zheng, Jie
Qian, Weikang
Chen, Jienan
2024 INTERNATIONAL SYMPOSIUM OF ELECTRONICS DESIGN AUTOMATION, ISEDA 2024, 2024, : 160 - 165
[45] Investor's ESG tendency probed by pre-trained transformers
Li, Chao
Keeley, Alexander Ryota
Takeda, Shutaro
Seki, Daikichi
Managi, Shunsuke
CORPORATE SOCIAL RESPONSIBILITY AND ENVIRONMENTAL MANAGEMENT, 2025, 32 (02) : 2051 - 2071
[46] TWilBert: Pre-trained deep bidirectional transformers for Spanish Twitter
Gonzalez, Jose Angel
Hurtado, Lluis-F.
Pla, Ferran
NEUROCOMPUTING, 2021, 426 : 58 - 69
[47] An Empirical Study of Pre-trained Transformers for Arabic Information Extraction
Lan, Wuwei
Chen, Yang
Xu, Wei
Ritter, Alan
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 4727 - 4734
[48] Causal Interpretation of Self-Attention in Pre-Trained Transformers
Rohekar, Raanan Y.
Gurwicz, Yaniv
Nisimov, Shami
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[49] Handwritten Document Recognition Using Pre-trained Vision Transformers
Parres, Daniel
Anitei, Dan
Paredes, Roberto
DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT II, 2024, 14805 : 173 - 190
[50] Experiments in News Bias Detection with Pre-trained Neural Transformers
Menzner, Tim
Leidner, Jochen L.
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT IV, 2024, 14611 : 270 - 284

← 1 2 3 4 5 →