ML Mob at SemEval-2023 Task 1: Probing CLIP on VisualWord-Sense Disambiguation

被引:0
|
作者
Poth, Clifton A. [1 ]
Hentschel, Martin B. [1 ]
Werner, Tobias [1 ]
Sterz, Hannah [1 ]
Bongard, Leonard [1 ]
机构
[1] Tech Univ Darmstadt, Darmstadt, Germany
来源
17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023 | 2023年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Successful word sense disambiguation (WSD) is a fundamental element of natural language understanding. As part of SemEval-2023 Task 1, we investigate WSD in a multimodal setting, where ambiguous words are to be matched with candidate images representing word senses. We compare multiple systems based on pre-trained CLIP models. In our experiments, we find CLIP to have solid zero-shot performance on monolingual and multilingual data. By employing different fine-tuning techniques, we are able to further enhance performance. However, transferring knowledge between data distributions proves to be more challenging.
引用
收藏
页码:1463 / 1469
页数:7
相关论文
共 25 条
  • [21] OPI PIB at SemEval-2023 Task 1: A CLIP-based Solution Paired with an Additional Word Context Extension
    Grebowiec, Malgorzata
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 482 - 487
  • [22] PMCoders at SemEval-2023 Task 1: RAltCLIP: Use Relative AltCLIP Features to Rank
    Pirhadi, Mohammad Javad
    Mirzaei, Motahhare
    Mohammadi, Mohammad Reza
    Eetemadi, Sauleh
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1751 - 1755
  • [23] RCLN at SemEval-2023 Task 1: Leveraging Stable Diffusion and Image Captions for Visual WSD
    Mijatovic, Antonina
    Borisova, Ekaterina
    Buscaldi, Davide
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2174 - 2178
  • [24] LCT-1 at SemEval-2023 Task 10: Pre-training and Multi-task Learning for Sexism Detection and Classification
    Chernyshev, Konstantin
    Garanina, Ekaterina
    Bayram, Duygu
    Zheng, Qiankun
    Edman, Lukas
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1573 - 1581
  • [25] Augmenters at SemEval-2023 Task 1: Enhancing CLIP in Handling Compositionality and Ambiguity for Zero-Shot Visual WSD through Prompt Augmentation and Text-To-Image Diffusion
    Li, Jie S.
    Shiue, Yow-Ting
    Shih, Yong-Siang
    Geiping, Jonas
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 44 - 49