Combining feature selection and classifier ensemble using a multiobjective simulated annealing approach: application to named entity recognition

被引:17
|
作者
Ekbal, Asif [1 ]
Saha, Sriparna [1 ]
机构
[1] Indian Inst Technol, Dept Comp Sci & Engn, Patna, Bihar, India
关键词
Natural language processing; Named entity recognition; Maximum entropy (ME); Conditional random field (CRF); Support vector machine (SVM); Multiobjective optimization (MOO); Simulated annealing (SA); Classifier ensemble; Weighted voting; ALGORITHM; WEB;
D O I
10.1007/s00500-012-0885-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a two-stage multiobjective-simulated annealing (MOSA)-based technique for named entity recognition (NER). At first, MOSA is used for feature selection under two statistical classifiers, viz. conditional random field (CRF) and support vector machine (SVM). Each solution on the final Pareto optimal front provides a different classifier. These classifiers are then combined together by using a new classifier ensemble technique based on MOSA. Several different versions of the objective functions are exploited. We hypothesize that the reliability of prediction of each classifier differs among the various output classes. Thus, in an ensemble system, it is necessary to find out the appropriate weight of vote for each output class in each classifier. We propose a MOSA-based technique to determine the weights for votes automatically. The proposed two-stage technique is evaluated for NER in Bengali, a resource-poor language, as well as for English. Evaluation results yield the highest recall, precision and F-measure values of 93.95, 95.15 and 94.55 %, respectively for Bengali and 89.01, 89.35 and 89.18 %, respectively for English. Experiments also suggest that the classifier ensemble identified by the proposed MOO-based approach optimizing the F-measure values of named entity (NE) boundary detection outperforms all the individual classifiers and four conventional baseline models.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
  • [11] Named entity recognition and classification in biomedical text using classifier ensemble
    Saha, Sriparna
    Ekbal, Asif
    Sikdar, Utpal Kumar
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 11 (04) : 365 - 391
  • [12] Hybrid Feature Selection Approach for Arabic Named Entity Recognition
    Shahine, Miran
    Sakre, Mohamed
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT I, 2018, 9623 : 452 - 464
  • [13] Finding Appropriate Subset of Votes Per Classifier Using Multiobjective Optimization: Application to Named Entity Recognition
    Ekbal, Asif
    Saha, Sriparna
    Hasanuzzaman, Md
    PROCEEDINGS OF THE 24TH PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, 2010, : 115 - 124
  • [14] Multiobjective Optimization Approach for Named Entity Recognition
    Ekbal, Asif
    Saha, Sriparna
    Garbe, Christoph S.
    PRICAI 2010: TRENDS IN ARTIFICIAL INTELLIGENCE, 2010, 6230 : 52 - +
  • [15] Classifier subset selection for biomedical named entity recognition
    Dimililer, Nazife
    Varoglu, Ekrem
    Altincay, Hakan
    APPLIED INTELLIGENCE, 2009, 31 (03) : 267 - 282
  • [16] Classifier subset selection for biomedical named entity recognition
    Nazife Dimililer
    Ekrem Varoğlu
    Hakan Altınçay
    Applied Intelligence, 2009, 31 : 267 - 282
  • [17] Feature Subset Selection Using Genetic Algorithm for Named Entity Recognition
    Hasanuzzaman, Md
    Saha, Sriparna
    Ekbal, Asif
    PROCEEDINGS OF THE 24TH PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, 2010, : 153 - 162
  • [18] MODE: multiobjective differential evolution for feature selection and classifier ensemble
    Sikdar, Utpal Kumar
    Ekbal, Asif
    Saha, Sriparna
    SOFT COMPUTING, 2015, 19 (12) : 3529 - 3549
  • [19] MODE: multiobjective differential evolution for feature selection and classifier ensemble
    Utpal Kumar Sikdar
    Asif Ekbal
    Sriparna Saha
    Soft Computing, 2015, 19 : 3529 - 3549
  • [20] Bengali Named Entity Recognition using Classifier Combination
    Ekbal, Asif
    Bandyopadhyay, Sivaji
    ICAPR 2009: SEVENTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, PROCEEDINGS, 2009, : 259 - 262