Word Representation Learning in Multimodal Pre-Trained Transformers: An Intrinsic Evaluation

被引：4

作者：

Pezzelle, Sandro ^{[1
]}

Takmaz, Ece ^{[1
]}

Fernandez, Raquel ^{[1
]}

机构：

[1] Univ Amsterdam, Inst Log Language & Computat, Amsterdam, Netherlands

来源：

TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS | 2021年 / 9卷

基金：

欧洲研究理事会;

关键词：

DISTRIBUTIONAL SEMANTICS; MODELS;

D O I：

10.1162/tacl_a_00443

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This study carries out a systematic intrinsic evaluation of the semantic representations learned by state-of-the-art pre-trained multimodal Transformers. These representations are claimed to be task-agnostic and shown to help on many downstream language-and-vision tasks. However, the extent to which they align with human semantic intuitions remains unclear. We experiment with various models and obtain static word representations from the contextualized ones they learn. We then evaluate them against the semantic judgments provided by human speakers. In linewith previous evidence, we observe a generalized advantage of multimodal representations over languageonly ones on concrete word pairs, but not on abstract ones. On the one hand, this confirms the effectiveness of these models to align language and vision, which results in better semantic representations for concepts that are grounded in images. On the other hand, models are shown to follow different representation learning patterns, which sheds some light on how and when they perform multimodal integration.

引用

页码：1563 / 1579

页数：17

共 50 条

[1] Are Pre-trained Convolutions Better than Pre-trained Transformers?
Tay, Yi
Dehghani, Mostafa
Gupta, Jai
Aribandi, Vamsi
Bahri, Dara
Qin, Zhen
Metzler, Donald
59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4349 - 4359
[2] Emotion Recognition with Pre-Trained Transformers Using Multimodal Signals
Vazquez-Rodriguez, Juan
Lefebvre, Gregoire
Cumin, Julien
Crowley, James L.
2022 10TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2022,
[3] Calibration of Pre-trained Transformers
Desai, Shrey
Durrett, Greg
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 295 - 302
[4] Emergent Modularity in Pre-trained Transformers
Zhang, Zhengyan
Zeng, Zhiyuan
Lin, Yankai
Xiao, Chaojun
Wang, Xiaozhi
Han, Xu
Liu, Zhiyuan
Xie, Ruobing
Sun, Maosong
Zhou, Jie
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 4066 - 4083
[5] Pre-trained transformers: an empirical comparison
Casola, Silvia
Lauriola, Ivano
Lavelli, Alberto
MACHINE LEARNING WITH APPLICATIONS, 2022, 9
[6] PROMISE: A pre-trained knowledge-infused multimodal representation learning framework for medication recommendation
Wu, Jialun
Yu, Xinyao
He, Kai
Gao, Zeyu
Gong, Tieliang
INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (04)
[7] Pre-trained Affective Word Representations
Chawla, Kushal
Khosla, Sopan
Chhaya, Niyati
Jaidka, Kokil
2019 8TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2019,
[8] Face Inpainting with Pre-trained Image Transformers
Gonc, Kaan
Saglam, Baturay
Kozat, Suleyman S.
Dibeklioglu, Hamdi
2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
[9] Enhancing Pre-trained Chinese Character Representation with Word-aligned Attention
Li, Yanzeng
Yu, Bowen
Xue, Mengge
Liu, Tingwen
58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 3442 - 3448
[10] A document representation framework with interpretable features using pre-trained word embeddings
Narendra Babu Unnam
P. Krishna Reddy
International Journal of Data Science and Analytics, 2020, 10 : 49 - 64

← 1 2 3 4 5 →