A rule-extraction-based optimization method for feature selection in word sense disambiguation

被引:0
|
作者
Li, Hongbo [1 ]
Yu, Jianping [1 ]
Hong, Wenxue [2 ]
机构
[1] College of Foreign Studies, Yanshan University, No. 438, West of Hebei Avenue, Qinhuangdao,066004, China
[2] Institute of Electrical Engineering, Yanshan University, No. 438, West of Hebei Avenue, Qinhuangdao,066004, China
来源
ICIC Express Letters | 2016年 / 10卷 / 06期
基金
中国国家自然科学基金;
关键词
Classification (of information) - Natural language processing systems - Extraction - Learning algorithms - Learning systems - Semantics;
D O I
暂无
中图分类号
学科分类号
摘要
Feature selection is an important process in classification and pattern recognition and it has a direct influence on the accuracy of classifier. In this study, a new optimization method of feature selection by means of rule extraction is proposed for word sense disambiguation (WSD) of English modal verb “must”. A WSD model with all candidate features for “must” is constructed first with the approach of structural partialordered attribute diagram (SPOAD) and the accuracy of WSD is tested to be 94.5%. Then based on the WSD model and the rule-extraction algorithm, rules for the two senses of “must” are extracted, and accordingly the optimized feature set with only 6 attributes is obtained. The WSD model with the optimized feature set yields a classification accuracy of 97.5%, which is 3% higher than that of the original model. Therefore, it is concluded that the proposed method can optimize the feature set and is effective in dealing with binary classification problems in WSD. It can also be applied to other binary-classifier research and provides valuable reference for feature selection in machine learning and natural language processing. © 2016 ISSN.
引用
收藏
页码:1325 / 1333
相关论文
共 50 条
  • [31] Graph Based Word Sense Disambiguation
    Koppula, Neeraja
    Rani, B. Padmaja
    Rao, Koppula Srinivas
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS, ICCII 2016, 2017, 507 : 665 - 670
  • [32] WordNet Based Word Sense Disambiguation
    Sieminski, Andrzej
    COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, PT II: THIRD INTERNATIONAL CONFERENCE, ICCCI 2011, 2011, 6923 : 405 - 414
  • [33] Correlation Based Word Sense Disambiguation
    Agarwal, Madhavi
    Bajpai, Jyoti
    2014 SEVENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2014, : 382 - 386
  • [34] Use of word sense disambiguation in an information extraction system
    IBM T. J. Watson Research Cent, Hawthorne, United States
    Proc Natl Conf Artif Intell, (850-855):
  • [35] The use of word sense disambiguation in an information extraction system
    Chai, JY
    Biermann, AW
    SIXTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-99)/ELEVENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE (IAAI-99), 1999, : 850 - 855
  • [36] Rules selection in word sense disambiguation using Adaboost
    Qin, Y
    Wang, XJ
    PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 26 - 29
  • [37] An Innovative Method for Hindi Word Sense Disambiguation
    Mishra B.K.
    Jain S.
    SN Computer Science, 4 (6)
  • [38] Bioinformatic Workflow Extraction from Scientific Texts based on Word Sense Disambiguation
    Halioui, Ahmed
    Valtchev, Petko
    Diallo, Abdoulaye Banire
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (06) : 1979 - 1990
  • [39] Deep Chinese Word Sense Disambiguation Method Based on Sequence to Sequence
    Tang Shancheng
    Ma Fuyu
    Chen Xiongxiong
    Zhang Puyue
    2018 INTERNATIONAL CONFERENCE ON SENSOR NETWORKS AND SIGNAL PROCESSING (SNSP 2018), 2018, : 498 - 503
  • [40] Research on the method of word sense disambiguation based on target language bigram
    Harbin Inst of Technology, Harbin, China
    Ruan Jian Xue Bao, 10 (21-25):