MOTIF: Contextualized Images for Complex Words to Improve Human Reading

被引:0
|
作者
Wang, Xintong [1 ]
Schneider, Florian [1 ]
Alacam, Ozge [2 ,3 ]
Chaudhury, Prateek
Biemann, Chris [1 ]
机构
[1] Univ Hamburg, Hamburg, Germany
[2] Univ Bielefeld, Bielefeld, Germany
[3] Indian Inst Technol Delhi, Delhi, India
关键词
Context-dependent image retrieval; L2 reading material; Complex word identification;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
MOTIF (MultimOdal ConTextualized Images For Language Learners) is a multimodal dataset that consists of 1125 comprehension texts retrieved from Wikipedia Simple Corpus. Allowing multimodal processing or enriching the context with multimodal information has proven imperative for many learning tasks, specifically for second language (L2) learning. In this respect, several traditional NLP approaches can assist L2 readers in text comprehension processes, such as simplifying text or giving dictionary descriptions for complex words. As nicely stated in the well-known proverb, sometimes "a picture is worth a thousand words" and an image can successfully complement the verbal message by enriching the representation, like in Pictionary books. This multimodal support can also assist on-the-fly text reading experience by providing a multimodal tool that chooses and displays the most relevant images for the complex words, given the text context. This study mainly focuses on one of the key components to achieving this goal; collecting a multimodal dataset enriched with complex word annotation and validated image match.
引用
收藏
页码:2468 / 2477
页数:10
相关论文
共 50 条