Corpus-based Semantic Relatedness for the Construction of Polish WordNet

被引:0
|
作者
Broda, Bartosz [1 ]
Derwojedowa, Magdalena [2 ]
Piasecki, Maciej [1 ]
Szpakowicz, Stanislaw [3 ,4 ]
机构
[1] Wroclaw Univ Technol, Inst Appl Informat, Wroclaw, Poland
[2] Warsaw Univ, Inst Polish Language, Warsaw, Poland
[3] Univ Ottawa, Sch Informat Technol & Engn, Ottawa, ON, Canada
[4] Polish Acad Sci, Inst Comp Sci, Warsaw, Poland
关键词
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
The construction of a wordnet, a labour-intensive enterprise, can be significantly assisted by automatic grouping of lexical material and discovery of lexical semantic relations. The objective is to ensure high quality of automatically acquired results before they are presented for lexicographers' approval. We discuss a software tool that suggests synset members using a measure of semantic relatedness with a given verb or adjective; this extends previous work on nominal synsets in Polish WordNet. Syntactically-motivated constraints are deployed on a large morphologically annotated corpus of Polish. Evaluation has been performed via the WordNet-Based Similarity Test and additionally supported by human raters. A lexicographer also manually assessed a suitable sample of suggestions. The results compare favourably with other known methods of acquiring semantic relations.
引用
收藏
页码:1800 / 1807
页数:8
相关论文
共 50 条
  • [1] A Comparison of Corpus-Based and Structural Methods on Approximation of Semantic Relatedness in Ontologies
    Ruotsalo, Tuukka
    Makela, Eetu
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2009, 5 (04) : 39 - 56
  • [2] Konkani WordNet: Corpus-Based Enhancement using Crowdsourcing
    Manerkar, Sanjana
    Asnani, Kavita
    Khorjuvenkar, Preeti Ravindranath
    Desai, Shilpa
    Pawar, Jyoti D.
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (04)
  • [3] ON THE POLYSEMY OF THE POLISH COMPLETE PATH CONSTRUCTION: A CORPUS-BASED EXPLORATORY STUDY
    Bebeniec, Daria
    Cudna, Malgorzata
    POZNAN STUDIES IN CONTEMPORARY LINGUISTICS, 2019, 55 (04): : 631 - 670
  • [4] WordNet Gloss for Semantic Concept Relatedness
    Bijaksana, Moch Arif
    Permadi, Rakhmad Indra
    RECENT ADVANCES ON SOFT COMPUTING AND DATA MINING, 2017, 549 : 406 - 413
  • [5] A Corpus-based Semantic Study of Possibly
    Wu, Guoliang
    Feng, Chuncan
    PROCEEDINGS OF 2011 INTERNATIONAL SYMPOSIUM ON COGNITIVE LINGUISTICS AND ENGLISH LEARNING, 2012, : 190 - 197
  • [6] Concept vector for semantic similarity and relatedness based on WordNet structure
    Liu, Hongzhe
    Bao, Hong
    Xu, De
    JOURNAL OF SYSTEMS AND SOFTWARE, 2012, 85 (02) : 370 - 381
  • [7] Evaluating WordNet-based measures of lexical semantic relatedness
    Budanitsky, Alexander
    Hirst, Graeme
    COMPUTATIONAL LINGUISTICS, 2006, 32 (01) : 13 - 47
  • [8] A Corpus-Based Study of Semantic Categorizations of Attracted Adjectives to the it BE ADJ clause Construction
    Wang, Jiaojiao
    Zhou, Jiangping
    SAGE OPEN, 2022, 12 (02):
  • [9] Complementing WordNet with Roget's and corpus-based thesauri for information retrieval
    Mandala, R
    Tokunaga, T
    Tanaka, H
    NINTH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS, 1999, : 94 - 101
  • [10] African Wordnet as a tool to identify semantic relatedness and semantic similarity
    Madonsela, Stanley
    SOUTH AFRICAN JOURNAL OF AFRICAN LANGUAGES, 2019, 39 (02) : 185 - 190