Large-scale Cross-lingual Language Resources for Referencing and Framing

被引:0
|
作者
Vossen, Piek [1 ]
Ilievski, Filip [1 ]
Postma, Marten [1 ]
Fokkens, Antske [1 ]
Minnema, Gosse [2 ]
Remijnse, Levi [1 ]
机构
[1] Vrije Univ Amsterdam, De Boelelaan 1105, NL-1081 HV Amsterdam, Netherlands
[2] Univ Groningen, Oude Kijk Int Jatstr 26, NL-9712 EK Groningen, Netherlands
关键词
framing; FrameNet; reference. situation semantics; events; cross-lingual text corpora; BASIC LEVEL;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this article, we lay out the basic ideas and principles of the project Framing Situations in the Dutch Language. We provide our first results of data acquisition, together with the first data release. We introduce the notion of cross-lingual referential corpora. These corpora consist of texts that make reference to exactly the same incidents. The referential grounding allows us to analyze the framing of these incidents in different languages and across different texts. During the project, we will use the automatically generated data to study linguistic framing as a phenomenon, build framing resources such as lexicons and corpora. We expect to capture larger variation in framing compared to traditional approaches for building such resources. Our first data release, which contains structured data about a large number of incidents and reference texts, can be found at http://dutchframenet.nl/data- releases/.
引用
收藏
页码:3162 / 3171
页数:10
相关论文
共 50 条
  • [31] Language Model Priming for Cross-Lingual Event Extraction
    Fincke, Steven
    Agarwal, Shantanu
    Miller, Scott
    Boschee, Elizabeth
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 10627 - 10635
  • [32] Multilingual Offensive Language Identification with Cross-lingual Embeddings
    Ranasinghe, Tharindu
    Zampieri, Marcos
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 5838 - 5844
  • [33] CROSS-LINGUAL TRANSFER LEARNING FOR SPOKEN LANGUAGE UNDERSTANDING
    Quynh Ngoc Thi Do
    Gaspers, Judith
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5956 - 5960
  • [34] Cross-Lingual Word Sense Disambiguation for Languages with Scarce Resources
    Sarrafzadeh, Bahareh
    Yakovets, Nikolay
    Cercone, Nick
    An, Aijun
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2011, 6657 : 347 - 358
  • [35] Neural Cross-Lingual Named Entity Recognition with Minimal Resources
    Xie, Jiateng
    Yang, Zhilin
    Neubig, Graham
    Smith, Noah A.
    Carbonell, Jaime
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 369 - 379
  • [36] Neural Cross-Lingual Event Detection with Minimal Parallel Resources
    Liu, Jian
    Chen, Yubo
    Liu, Kang
    Zhao, Jun
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 738 - 748
  • [37] Improving Large-scale Language Models and Resources for Filipino
    Cruz, Jan Christian Blaise
    Cheng, Charibeth
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6548 - 6555
  • [38] Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models
    He, Zhiwei
    Zhou, Binglin
    Hao, Hongkun
    Liu, Aiwei
    Wang, Xing
    Tu, Zhaopeng
    Zhang, Zhuosheng
    Wang, Rui
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 4115 - 4129
  • [39] Design and Development of a Large Cross-Lingual Plagiarism Corpus for Urdu-English Language Pair
    Haneef, Israr
    Nawab, Rao Muhammad Adeel
    Munir, Ehsan Ullah
    Bajwa, Imran Sarwar
    SCIENTIFIC PROGRAMMING, 2019, 2019
  • [40] Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models
    Qi, Jirui
    Fernandez, Raquel
    Bisazza, Arianna
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 10650 - 10666