Towards Automatic Acquisition of a Fully Sense Tagged Corpus for Persian

被引:0
|
作者
Sarrafzadeh, Bahareh [1 ]
Yakovets, Nikolay [1 ]
Cercone, Nick [1 ]
An, Aijun [1 ]
机构
[1] York Univ, Dept Comp Sci & Engn, N York, ON M3J 1P3, Canada
来源
FOUNDATIONS OF INTELLIGENT SYSTEMS | 2011年 / 6804卷
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Sense tagged corpora play a crucial role in Natural Language Processing, particularly in Word Sense Disambiguation and Natural Language Understanding. Since semantic annotations are usually performed by humans, such corpora are limited to a handful of tagged texts and are not available for many languages with scarce resources including Persian. The shortage of efficient, reliable linguistic resources and fundamental text processing modules for Persian have been a challenge for researchers investigating this language. We employ a newly-proposed cross-lingual sense disambiguation algorithm to automatically create large sense tagged corpora. The initial evaluation of the tagged corpus indicates promising results.
引用
收藏
页码:449 / 455
页数:7
相关论文
共 50 条
  • [1] AutoASC - A system for automatic acquisition of sense tagged corpora
    Mihalcea, R
    Moldovan, DI
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2000, 14 (01) : 3 - 17
  • [2] Design and prototype of a large-scale and fully sense-tagged corpus
    Ker, Sue-Jin
    Huang, Chu-Ren
    Hong, Jia-Fei
    Liu, Shi-Yin
    Jian, Hui-Ling
    Su, I-Li
    Hsieh, Shu-Kai
    LARGE-SCALE KNOWLEDGE RESOURCES: CONSTRUCTION AND APPLICATION, 2008, 4938 : 186 - +
  • [3] An Experience in Developing the Nepali Sense Tagged Corpus
    Sarkar, Sunita
    Roy, Arindam
    Paul, Abhijit
    Purkayastha, Bipul Syam
    2015 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION & AUTOMATION (ICCCA), 2015, : 279 - 282
  • [4] Building The Sense-Tagged Multilingual Parallel Corpus
    Wang, Shan
    Bond, Francis
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 2403 - 2409
  • [5] DutchSemCor: Targeting the ideal sense-tagged corpus
    Vossen, Piek
    Gorog, Attila
    Izquierdo, Ruben
    van den Bosch, Antal
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 584 - 589
  • [6] Japanese semcor: A sense-tagged corpus of Japanese
    Linguistics and Multilingual Studies, Nanyang Technological University, Singapore
    不详
    不详
    GWC Int. WordNet Conf. Proc., (56-63):
  • [7] An automatic method for generating sense tagged corpora
    Mihalcea, R
    Moldovan, DI
    SIXTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-99)/ELEVENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE (IAAI-99), 1999, : 461 - 466
  • [8] Building Sense Tagged Corpus Using Wikipedia for Supervised Word Sense Disambiguation
    Saif, Abdulgabbar
    Omar, Nazlia
    Zainodin, Ummi Zakiah
    Ab Aziz, Mohd Juziaddin
    8TH ANNUAL INTERNATIONAL CONFERENCE ON BIOLOGICALLY INSPIRED COGNITIVE ARCHITECTURES, BICA 2017 (EIGHTH ANNUAL MEETING OF THE BICA SOCIETY), 2018, 123 : 403 - 412
  • [9] A Preliminary Study on Semi-automatic Construction of Sense Tagged Corpus with WordNet Senses Using Semantic Vector
    Tuyen Thi-Thanh Do
    2017 SEVENTH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST2017), 2017, : 490 - 496
  • [10] Creating a Corpus for Automatic Punctuation Prediction in Persian Texts
    Hosseini, Seyyed MohammadSaleh
    Sameti, Hossein
    2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 1537 - 1542