A rule-extraction-based optimization method for feature selection in word sense disambiguation

被引:0
|
作者
Li, Hongbo [1 ]
Yu, Jianping [1 ]
Hong, Wenxue [2 ]
机构
[1] College of Foreign Studies, Yanshan University, No. 438, West of Hebei Avenue, Qinhuangdao,066004, China
[2] Institute of Electrical Engineering, Yanshan University, No. 438, West of Hebei Avenue, Qinhuangdao,066004, China
来源
ICIC Express Letters | 2016年 / 10卷 / 06期
基金
中国国家自然科学基金;
关键词
Classification (of information) - Natural language processing systems - Extraction - Learning algorithms - Learning systems - Semantics;
D O I
暂无
中图分类号
学科分类号
摘要
Feature selection is an important process in classification and pattern recognition and it has a direct influence on the accuracy of classifier. In this study, a new optimization method of feature selection by means of rule extraction is proposed for word sense disambiguation (WSD) of English modal verb “must”. A WSD model with all candidate features for “must” is constructed first with the approach of structural partialordered attribute diagram (SPOAD) and the accuracy of WSD is tested to be 94.5%. Then based on the WSD model and the rule-extraction algorithm, rules for the two senses of “must” are extracted, and accordingly the optimized feature set with only 6 attributes is obtained. The WSD model with the optimized feature set yields a classification accuracy of 97.5%, which is 3% higher than that of the original model. Therefore, it is concluded that the proposed method can optimize the feature set and is effective in dealing with binary classification problems in WSD. It can also be applied to other binary-classifier research and provides valuable reference for feature selection in machine learning and natural language processing. © 2016 ISSN.
引用
收藏
页码:1325 / 1333
相关论文
共 50 条
  • [1] Japanese word sense disambiguation system based on deep feature extraction
    Lei, Xue-Mei
    Wang, Da-Liang
    Takaaki, Tanaka
    Zeng, Guang-Ping
    Beijing Keji Daxue Xuebao/Journal of University of Science and Technology Beijing, 2010, 32 (02): : 263 - 269
  • [2] Chinese word sense disambiguation based on maximum entropy model with feature selection
    He J.-Z.
    Wang H.-F.
    Ruan Jian Xue Bao/Journal of Software, 2010, 21 (06): : 1287 - 1295
  • [3] Best feature selection for maximum entropy-based word sense disambiguation
    Suárez, A
    Palomar, M
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2002, 2553 : 213 - 217
  • [4] Rule extraction for word sense disambiguation of english modal verb must
    Yu, J. (yjp@ysu.edu.cn), 1600, ICIC Express Letters Office, Tokai University, Kumamoto Campus, 9-1-1, Toroku, Kumamoto, 862-8652, Japan (07):
  • [5] Word Sense Disambiguation Based on Feature Ranking Graph
    Li, Yeqing
    Qiu, Xiaoyu
    2015 IEEE 29TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS WAINA 2015, 2015, : 209 - 212
  • [6] Feature Words Selection for Knowledge-based Word Sense Disambiguation with Syntactic Parsing
    Lu, Wenpeng
    Huang, Heyan
    Zhu, Chaoyong
    PRZEGLAD ELEKTROTECHNICZNY, 2012, 88 (1B): : 82 - 87
  • [7] Feature expansion for word sense disambiguation
    Tsao, NL
    Wible, D
    Kuo, CH
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 126 - 131
  • [8] Word Sense Disambiguation Studio: A Flexible System for WSD Feature Extraction
    Agre, Gennady
    Petrov, Daniel
    Keskinova, Simona
    INFORMATION, 2019, 10 (03)
  • [9] A Word Sense Disambiguation Method for Feature Level Sentiment Analysis
    Farooq, Umar
    Dhamala, Tej Prasad
    Nongaillard, Antoine
    Ouzrout, Yacine
    Qadir, Muhammad Abdul
    2015 9TH INTERNATIONAL CONFERENCE ON SOFTWARE, KNOWLEDGE, INFORMATION MANAGEMENT AND APPLICATIONS (SKIMA), 2015,
  • [10] Word sense disambiguation by relative selection
    Seo, HC
    Rim, HC
    Jang, MG
    NATURAL LANGUAGE PROCESSING - IJCNLP 2005, PROCEEDINGS, 2005, 3651 : 920 - 932