Combining feature selection and classifier ensemble using a multiobjective simulated annealing approach: application to named entity recognition

被引:17
|
作者
Ekbal, Asif [1 ]
Saha, Sriparna [1 ]
机构
[1] Indian Inst Technol, Dept Comp Sci & Engn, Patna, Bihar, India
关键词
Natural language processing; Named entity recognition; Maximum entropy (ME); Conditional random field (CRF); Support vector machine (SVM); Multiobjective optimization (MOO); Simulated annealing (SA); Classifier ensemble; Weighted voting; ALGORITHM; WEB;
D O I
10.1007/s00500-012-0885-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a two-stage multiobjective-simulated annealing (MOSA)-based technique for named entity recognition (NER). At first, MOSA is used for feature selection under two statistical classifiers, viz. conditional random field (CRF) and support vector machine (SVM). Each solution on the final Pareto optimal front provides a different classifier. These classifiers are then combined together by using a new classifier ensemble technique based on MOSA. Several different versions of the objective functions are exploited. We hypothesize that the reliability of prediction of each classifier differs among the various output classes. Thus, in an ensemble system, it is necessary to find out the appropriate weight of vote for each output class in each classifier. We propose a MOSA-based technique to determine the weights for votes automatically. The proposed two-stage technique is evaluated for NER in Bengali, a resource-poor language, as well as for English. Evaluation results yield the highest recall, precision and F-measure values of 93.95, 95.15 and 94.55 %, respectively for Bengali and 89.01, 89.35 and 89.18 %, respectively for English. Experiments also suggest that the classifier ensemble identified by the proposed MOO-based approach optimizing the F-measure values of named entity (NE) boundary detection outperforms all the individual classifiers and four conventional baseline models.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
  • [31] Sentiment classification using hybrid feature selection and ensemble classifier
    Jain, Achin
    Jain, Vanita
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (02) : 659 - 668
  • [32] A Comparative Study of Named Entity Recognition for Arabic Using Ensemble Learning Approaches
    El bazi, Ismail
    Laachfoubi, Nabil
    2015 IEEE/ACS 12TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2015,
  • [33] Pattern recognition using evolutionary classifier and feature selection
    Nam, Mi Young
    Rhee, Phill Kyu
    FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4223 : 393 - 399
  • [34] Cybersecurity Named Entity Recognition Using Multi-Modal Ensemble Learning
    Yi, Feng
    Jiang, Bo
    Wang, Lu
    Wu, Jianjun
    IEEE ACCESS, 2020, 8 : 63214 - 63224
  • [35] Strip Hardness Prediction in Continuous Annealing Using Multiobjective Sparse Nonlinear Ensemble Learning With Evolutionary Feature Selection
    Wang, Xianpeng
    Wang, Yao
    Tang, Lixin
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2022, 19 (03) : 2397 - 2411
  • [36] An Ensemble Classifier Approach on Different Feature Selection Methods for Intrusion Detection
    Vinutha, H. P.
    Poornima, B.
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, INDIA 2017, 2018, 672 : 442 - 451
  • [37] An Experimental evaluation of Feature selection based Classifier Ensemble for Handwritten Numeral Recognition
    Singh, Pratibha
    Verma, Ajay
    Chaudhari, Narendra S.
    2014 INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION SYSTEMS (ICECS), 2014,
  • [38] Amazighe Named Entity Recognition Using a A Rule Based Approach
    Boulaknadel, Siham
    Talha, Meryem
    Aboutajdine, Driss
    2014 IEEE/ACS 11TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2014, : 478 - 484
  • [39] Named Entity Recognition in Crime Using Machine Learning Approach
    Shabat, Hafedh
    Omar, Nazlia
    Rahem, Khmael
    INFORMATION RETRIEVAL TECHNOLOGY, AIRS 2014, 2014, 8870 : 280 - 288
  • [40] Named entity recognition using hybrid machine learning approach
    Chiong, Raymond
    Wei, Wang
    PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, VOLS 1 AND 2, 2006, : 578 - 583