Zuo Zhuan Ancient Chinese Dataset for Word Sense Disambiguation

被引:0
|
作者
Pan, Xiaomeng [1 ]
Wang, Hongfei [1 ]
Oka, Teruaki [1 ]
Komachi, Mamoru [1 ]
机构
[1] Tokyo Metropolitan Univ, Tokyo, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Word Sense Disambiguation (WSD) is a core task in Natural Language Processing (NLP). Ancient Chinese has rarely been used in WSD tasks, however, as no public dataset for ancient Chinese WSD tasks exists. Creation of an ancient Chinese dataset is considered a significant challenge because determining the most appropriate sense in a context is difficult and time-consuming owing to the different usages in ancient and modern Chinese. Actually, no public dataset for ancient Chinese WSD tasks exists. To solve the problem of ancient Chinese WSD, we annotate part of Pre-Qin (221 BC) text Zuo Zhuan using a copyright-free dictionary to create a public sense-tagged dataset. Then, we apply a simple Nearest Neighbors (k-NN) method using a pre-trained language model to the dataset. Our code and dataset will be available on GitHub(1).
引用
收藏
页码:129 / 135
页数:7
相关论文
共 50 条
  • [41] Symmetric is Not the Optimal Local Context Window in Chinese Word Sense Disambiguation
    Li, Gang
    Kou, Guangzeng
    Quan, Ji
    2009 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND COMPUTER SCIENCE, VOL 1, PROCEEDINGS, 2009, : 201 - +
  • [42] Deep Chinese Word Sense Disambiguation Method Based on Sequence to Sequence
    Tang Shancheng
    Ma Fuyu
    Chen Xiongxiong
    Zhang Puyue
    2018 INTERNATIONAL CONFERENCE ON SENSOR NETWORKS AND SIGNAL PROCESSING (SNSP 2018), 2018, : 498 - 503
  • [43] SensPick: Sense Picking for Word Sense Disambiguation
    Zobaed, Sm
    Haque, Md Enamul
    Rabby, Md Fazle
    Salehi, Mohsen Amini
    2021 IEEE 15TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2021), 2021, : 318 - 324
  • [44] Graph and Word Similarity for Word Sense Disambiguation
    Meng, Fanqing
    2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 1114 - 1118
  • [45] Biomedical Word Sense Disambiguation with Word Embeddings
    Antunes, Rui
    Matos, Sergio
    11TH INTERNATIONAL CONFERENCE ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY & BIOINFORMATICS, 2017, 616 : 273 - 279
  • [46] Genetic Word Sense Disambiguation Algorithm
    Zhang, ChunHui
    Zhou, Yiming
    Martin, Trevor
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL I, PROCEEDINGS, 2008, : 123 - +
  • [47] Word sense disambiguation in evolutionary manner
    Abed, Saad Adnan
    Tiun, Sabrina
    Omar, Nazlia
    CONNECTION SCIENCE, 2016, 28 (03) : 226 - 241
  • [48] An Improved Word Sense Disambiguation Method
    Yu, Linlin
    Song, Lifang
    Sun, Jianyan
    Li, Lin
    2016 6TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY FOR MANUFACTURING SYSTEMS (ITMS 2016), 2016, : 153 - 155
  • [49] A Word Sense Disambiguation Technique for Sinhala
    Arukgoda, Janindu
    Bandara, Vidudaya
    Bashani, Samiththa
    Gamage, Vijayindu
    Wimalasuriya, Daya
    PROCEEDINGS 2014 4TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE WITH APPLICATIONS IN ENGINEERING AND TECHNOLOGY ICAIET 2014, 2014, : 207 - 211
  • [50] Minimal Semantics and Word Sense Disambiguation
    Gasparri, Luca
    DISPUTATIO-INTERNATIONAL JOURNAL OF PHILOSOPHY, 2014, 6 (39): : 147 - 171