Zuo Zhuan Ancient Chinese Dataset for Word Sense Disambiguation

被引:0
|
作者
Pan, Xiaomeng [1 ]
Wang, Hongfei [1 ]
Oka, Teruaki [1 ]
Komachi, Mamoru [1 ]
机构
[1] Tokyo Metropolitan Univ, Tokyo, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Word Sense Disambiguation (WSD) is a core task in Natural Language Processing (NLP). Ancient Chinese has rarely been used in WSD tasks, however, as no public dataset for ancient Chinese WSD tasks exists. Creation of an ancient Chinese dataset is considered a significant challenge because determining the most appropriate sense in a context is difficult and time-consuming owing to the different usages in ancient and modern Chinese. Actually, no public dataset for ancient Chinese WSD tasks exists. To solve the problem of ancient Chinese WSD, we annotate part of Pre-Qin (221 BC) text Zuo Zhuan using a copyright-free dictionary to create a public sense-tagged dataset. Then, we apply a simple Nearest Neighbors (k-NN) method using a pre-trained language model to the dataset. Our code and dataset will be available on GitHub(1).
引用
收藏
页码:129 / 135
页数:7
相关论文
共 50 条
  • [1] A comprehensive dataset for Arabic word sense disambiguation
    Kaddoura, Sanaa
    Nassar, Reem
    DATA IN BRIEF, 2024, 55
  • [2] Discover Social Relations and Activities from Ancient Chinese History Book Zuo Zhuan
    Li, Bin
    Wang, Lu
    Wen, Yuan
    Chen, Xiaohe
    Gu, Yanhui
    PROCEEDINGS OF 4TH INTERNATIONAL CONFERENCE ON BEHAVIORAL, ECONOMIC ADVANCE IN BEHAVIORAL, ECONOMIC, SOCIOCULTURAL COMPUTING (BESC), 2017,
  • [3] Word Sense Indicators: Effective Feature for Chinese Word Sense Disambiguation
    Quan, Changqin
    Ren, Fuji
    He, Tingting
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2009, 12 (05): : 1157 - 1164
  • [4] A dataset for evaluating Bengali word sense disambiguation techniques
    Das Dawn D.
    Khan A.
    Shaikh S.H.
    Pal R.K.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (04) : 4057 - 4086
  • [5] Ensembles of classifiers for Chinese word sense disambiguation
    Wu, Yunfang
    Wang, Miao
    Jin, Peng
    Yu, Shiwen
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2008, 45 (08): : 1354 - 1361
  • [6] Chinese word sense disambiguation using HowNet
    Zhang, YT
    Gong, L
    Wang, YC
    ADVANCES IN NATURAL COMPUTATION, PT 1, PROCEEDINGS, 2005, 3610 : 925 - 932
  • [7] Application of boosting to Chinese word sense disambiguation
    Quan, CQ
    He, TT
    Hu, P
    Ji, DH
    PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 9 - 13
  • [8] Chinese Word Sense Disambiguation Using a LSTM
    Sun, Xue-Ren
    Lv, Shao-He
    Wang, Xiao-Dong
    Wang, Dong
    4TH ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS (ITA 2017), 2017, 12
  • [9] The Research of Chinese Name Entity Disambiguation Based On Word Sense Disambiguation
    Wang, Gang
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ELECTRONIC TECHNOLOGY, 2015, 6 : 412 - 416
  • [10] Attribute knowledge mining for Chinese word sense disambiguation
    Duan, Jianyong
    Fu, Yao
    Li, Xia
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2015, : 73 - 77