A rule-extraction-based optimization method for feature selection in word sense disambiguation

被引:0
|
作者
Li, Hongbo [1 ]
Yu, Jianping [1 ]
Hong, Wenxue [2 ]
机构
[1] College of Foreign Studies, Yanshan University, No. 438, West of Hebei Avenue, Qinhuangdao,066004, China
[2] Institute of Electrical Engineering, Yanshan University, No. 438, West of Hebei Avenue, Qinhuangdao,066004, China
来源
ICIC Express Letters | 2016年 / 10卷 / 06期
基金
中国国家自然科学基金;
关键词
Classification (of information) - Natural language processing systems - Extraction - Learning algorithms - Learning systems - Semantics;
D O I
暂无
中图分类号
学科分类号
摘要
Feature selection is an important process in classification and pattern recognition and it has a direct influence on the accuracy of classifier. In this study, a new optimization method of feature selection by means of rule extraction is proposed for word sense disambiguation (WSD) of English modal verb “must”. A WSD model with all candidate features for “must” is constructed first with the approach of structural partialordered attribute diagram (SPOAD) and the accuracy of WSD is tested to be 94.5%. Then based on the WSD model and the rule-extraction algorithm, rules for the two senses of “must” are extracted, and accordingly the optimized feature set with only 6 attributes is obtained. The WSD model with the optimized feature set yields a classification accuracy of 97.5%, which is 3% higher than that of the original model. Therefore, it is concluded that the proposed method can optimize the feature set and is effective in dealing with binary classification problems in WSD. It can also be applied to other binary-classifier research and provides valuable reference for feature selection in machine learning and natural language processing. © 2016 ISSN.
引用
收藏
页码:1325 / 1333
相关论文
共 50 条
  • [41] Word sense disambiguation based on rough set
    陈清才
    王晓龙
    赵健
    陈滨
    王长风
    Journal of Harbin Institute of Technology, 2002, (02) : 201 - 204
  • [42] Word Sense Disambiguation based on Gloss Expansion
    Fard, M. Hazrati
    Fakhrahmad, S. M.
    Sadreddini, M. H.
    2014 6TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2014, : 7 - 10
  • [43] Word Sense Disambiguation Based on Semantic Knowledge
    Liang, Rui-Yan
    Luo, Chun-Yi
    Zhang, Chun-Xiang
    Lei, Tian-Yi
    Wang, Huan-Xi
    Li, Ming-Zhe
    PROCEEDINGS OF 2019 IEEE 2ND INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION AND COMMUNICATION TECHNOLOGY (ICEICT 2019), 2019, : 645 - 648
  • [44] Memory-based word sense disambiguation
    Veenstra, J
    van den Bosch, A
    Buchholz, S
    Daelemans, W
    Zavrel, J
    COMPUTERS AND THE HUMANITIES, 2000, 34 (1-2): : 171 - 177
  • [45] Word Sense Disambiguation based on Relation Structure
    Hwang, Myunggwon
    Choi, Chang
    Youn, Byungsu
    Kim, Pankoo
    ALPIT 2008: SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED LANGUAGE PROCESSING AND WEB INFORMATION TECHNOLOGY, PROCEEDINGS, 2008, : 15 - +
  • [46] Similarity-based word sense disambiguation
    Karov, Y
    Edelman, S
    COMPUTATIONAL LINGUISTICS, 1998, 24 (01) : 41 - 59
  • [47] Memory-Based Word Sense Disambiguation
    Jorn Veenstra
    Antal van den Bosch
    Sabine Buchholz
    Walter Daelemans
    akub Zavrel
    Computers and the Humanities, 2000, 34 : 171 - 177
  • [48] Word Sense Disambiguation Based on Vicarious Words
    Lu, Zhimao
    Fan, DongMei
    Zhang, Rubo
    ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 6, PROCEEDINGS, 2008, : 101 - 105
  • [49] The Research of Chinese Name Entity Disambiguation Based On Word Sense Disambiguation
    Wang, Gang
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ELECTRONIC TECHNOLOGY, 2015, 6 : 412 - 416
  • [50] Unsupervised Word Sense Disambiguation based on Word Embedding and Collocation
    Han, Shangzhuang
    Shirai, Kiyoaki
    ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2021, : 1218 - 1225