Corpus-based Semantic Relatedness for the Construction of Polish WordNet

被引:0
|
作者
Broda, Bartosz [1 ]
Derwojedowa, Magdalena [2 ]
Piasecki, Maciej [1 ]
Szpakowicz, Stanislaw [3 ,4 ]
机构
[1] Wroclaw Univ Technol, Inst Appl Informat, Wroclaw, Poland
[2] Warsaw Univ, Inst Polish Language, Warsaw, Poland
[3] Univ Ottawa, Sch Informat Technol & Engn, Ottawa, ON, Canada
[4] Polish Acad Sci, Inst Comp Sci, Warsaw, Poland
关键词
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
The construction of a wordnet, a labour-intensive enterprise, can be significantly assisted by automatic grouping of lexical material and discovery of lexical semantic relations. The objective is to ensure high quality of automatically acquired results before they are presented for lexicographers' approval. We discuss a software tool that suggests synset members using a measure of semantic relatedness with a given verb or adjective; this extends previous work on nominal synsets in Polish WordNet. Syntactically-motivated constraints are deployed on a large morphologically annotated corpus of Polish. Evaluation has been performed via the WordNet-Based Similarity Test and additionally supported by human raters. A lexicographer also manually assessed a suitable sample of suggestions. The results compare favourably with other known methods of acquiring semantic relations.
引用
收藏
页码:1800 / 1807
页数:8
相关论文
共 50 条
  • [41] Corpus-based approaches to semantic interpretation in natural language processing
    Ng, HT
    Zelle, J
    AI MAGAZINE, 1997, 18 (04) : 45 - 64
  • [42] A corpus-based study of the Spanish comparative correlative construction
    Horsch, Jakob
    REVIEW OF COGNITIVE LINGUISTICS, 2023,
  • [43] Integration of semantic networks for corpus-based word sense disambiguation
    Moon, YJ
    Min, KH
    Hwang, YH
    Kim, P
    LOGIC PROGRAMMING, PROCEEDINGS, 2003, 2916 : 492 - 493
  • [44] Translations as semantic mirrors: from parallel corpus to wordnet
    Dyvik, H
    ADVANCES IN CORPUS LINGUISTICS, 2004, (49): : 311 - 326
  • [45] Words, Concepts and Relations in the Construction of Polish WordNet
    Derwojedowa, Magdalena
    Piasecki, Maciej
    Szpakowicz, Stanislaw
    Zawislawska, Magdalena
    Broda, Bartosz
    GWC 2008: FOURTH GLOBAL WORDNET CONFERENCE, PROCEEDINGS, 2007, : 162 - 177
  • [46] Semantic search extension based on Polish WordNet relations in business document exploration
    Potiopa, Piotr
    Karwatowski, Michal
    Duda, Jerzy
    Sasor, Pawel
    Wielgosz, Maciej
    Muzykiewicz, Bartlomiej
    PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON INTERNET OF THINGS AND MACHINE LEARNING (IML'17), 2017,
  • [47] The Division into Parts of Speech in the Corpus-based Dictionary of Polish Sign Language
    Linde-Usiekniewicz, Jadwiga
    Rutkowski, Pawel
    PROCEEDINGS OF THE XVII EURALEX INTERNATIONAL CONGRESS: LEXICOGRAPHY AND LINGUISTIC DIVERSITY, 2016, : 375 - 388
  • [48] CORPUS-BASED SYNTACTIC-SEMANTIC GRAPH ANALYSIS: SEMANTIC DOMAINS OF THE CONCEPT FEELING
    Perak, Benedikt
    Kirigin, Tajana Ban
    RASPRAVE, 2020, 46 (02): : 957 - 996
  • [49] Standardisation tendencies in an expanded Europe - A corpus-based study of the anglicisms in Polish
    Wingendcr, Monika
    WELT DER SLAVEN-HALBJAHRESSCHRIFT FUR SLAVISTIK, 2007, 52 (01): : 1 - 20
  • [50] Sense-Based Clustering of Polish Nouns in the Extraction of Semantic Relatedness
    Broda, Bartosz
    Piasecki, Maciej
    Szpakowicz, Stanislaw
    2008 INTERNATIONAL MULTICONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (IMCSIT), VOLS 1 AND 2, 2008, : 72 - +