ML Mob at SemEval-2023 Task 1: Probing CLIP on VisualWord-Sense Disambiguation

被引:0
|
作者
Poth, Clifton A. [1 ]
Hentschel, Martin B. [1 ]
Werner, Tobias [1 ]
Sterz, Hannah [1 ]
Bongard, Leonard [1 ]
机构
[1] Tech Univ Darmstadt, Darmstadt, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Successful word sense disambiguation (WSD) is a fundamental element of natural language understanding. As part of SemEval-2023 Task 1, we investigate WSD in a multimodal setting, where ambiguous words are to be matched with candidate images representing word senses. We compare multiple systems based on pre-trained CLIP models. In our experiments, we find CLIP to have solid zero-shot performance on monolingual and multilingual data. By employing different fine-tuning techniques, we are able to further enhance performance. However, transferring knowledge between data distributions proves to be more challenging.
引用
收藏
页码:1463 / 1469
页数:7
相关论文
共 25 条
  • [1] teamPN at SemEval-2023 Task 1: VisualWord Sense Disambiguation Using Zero-Shot MultiModal Approach
    Katyal, Nikita
    Rajpoot, Pawan
    Tamilarasu, Subhanandh
    Mustafi, Joy
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 457 - 461
  • [2] SemEval-2023 Task 1: Visual Word Sense Disambiguation
    Raganato, Alessandro
    Calixto, Iacer
    Ushio, Asahi
    Camacho-Collados, Jose
    Pilehvar, Mohammad Taher
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2227 - 2234
  • [3] SUT at SemEval-2023 Task 1: Prompt Generation for Visual Word Sense Disambiguation
    Ghahroodi, Omid
    Dalili, Seyed Arshan
    Mesforoush, Sahel
    Asgari, Ehsaneddin
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2160 - 2163
  • [4] PoliTo at SemEval-2023 Task 1: CLIP-based Visual-Word Sense Disambiguation Based on Back-Translation
    Vaiani, Lorenzo
    Cagliero, Luca
    Garza, Paolo
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1447 - 1453
  • [5] UAlberta at SemEval-2023 Task 1: Context Augmentation and Translation for Multilingual Visual Word Sense Disambiguation
    Ogezi, Michael
    Hauer, Bradley
    Omarov, Talgat
    Shi, Ning
    Kondrak, Grzegorz
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2043 - 2051
  • [6] Ebhaam at SemEval-2023 Task 1: A CLIP-Based Approach for Comparing Cross-modality and Unimodality in Visual Word Sense Disambiguation
    Taghavi, Zeinab
    Naeini, Parsa Haghighi
    Sadraei, Mohammad Ali
    Gooran, Soroush
    Asgari, Ehsaneddin
    Rabiee, Hamid Reza
    Sameti, Hossein
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1960 - 1964
  • [7] Rahul Patil at SemEval-2023 Task 1: V-WSD: Visual Word Sense Disambiguation
    Patil, Rahul
    Patel, Pinal
    Patel, Charin
    Verma, Mangal
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1271 - 1275
  • [8] GPL at SemEval-2023 Task 1: WordNet and CLIP to Disambiguate Images
    Zhang, Shibingfeng
    Nath, Shantanu
    Mazzaccara, Davide
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1592 - 1597
  • [9] TAM of SCNU at SemEval-2023 Task 1: FCLL: A Fine-grained Contrastive Language-Image Learning Model for Cross-language VisualWord Sense Disambiguation
    Yang, Qihao
    Li, Yong
    Wang, Xuelin
    Li, Shunhao
    Hao, Tianyong
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 506 - 511
  • [10] SzegedAI at SemEval-2023 Task 1: Applying Quasi-Symbolic Representations in Visual Word Sense Disambiguation
    Berend, Gabor
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1965 - 1971