Data-Driven Regular Expressions Evolution for Medical Text Classification Using Genetic Programming

被引:0
|
作者
Liu, Jiandong [1 ]
Bai, Ruibin [1 ]
Lu, Zheng [1 ]
Ge, Peiming [2 ]
Aickelin, Uwe [3 ]
Liu, Daoyun [2 ]
机构
[1] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo, Peoples R China
[2] Ping An Hlth Cloud Co Ltd China, Techonol Dept, Shanghai, Peoples R China
[3] Univ Melbourne, Sch Comp & Informat Syst, Melbourne, Vic, Australia
关键词
text classification; genetic programming; co-occurrence matrix; EXPERT-SYSTEM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In medical fields, text classification is one of the most important tasks that can significantly reduce human workload through structured information digitization and intelligent decision support. Despite the popularity of learning-based text classification techniques, it is hard for human to understand or manually fine-tune the classification for better precision and recall, due to the black box nature of learning. This study proposes a novel regular expression-based text classification method making use of genetic programming (GP) approaches to evolve regular expressions that can classify a given medical text inquiry with satisfaction. Given a seed population of regular expressions (randomly initialized or manually constructed by experts), our method evolves a population of regular expressions, using a novel regular expression syntax and a series of carefully chosen reproduction operators. Our method is evaluated with real-life medical text inquiries from an online healthcare provider and shows promising performance. More importantly, our method generates classifiers that can be fully understood, checked and updated by medical doctors, which are fundamentally crucial for medical related practices.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] A data-driven classification of feelings
    Thomson, David M. H.
    Crocker, Christopher
    FOOD QUALITY AND PREFERENCE, 2013, 27 (02) : 137 - 152
  • [22] Data-Driven Differential Dynamic Programming Using Gaussian Processes
    Pan, Yunpeng
    Theodorou, Evangelos A.
    2015 AMERICAN CONTROL CONFERENCE (ACC), 2015, : 4467 - 4472
  • [23] Modelling Dynamic Facial Expressions of Emotion Using Data-Driven Methods
    Jack, Rachael Elizabeth
    PERCEPTION, 2019, 48 : 62 - 62
  • [24] Automatic Generation of Regular Expressions from Examples with Genetic Programming
    Bartoli, Alberto
    Davanzo, Giorgio
    De Lorenzo, Andrea
    Mauri, Marco
    Medvet, Eric
    Sorio, Enrico
    PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTATION COMPANION (GECCO'12), 2012, : 1477 - 1478
  • [25] Evolving text classification rules with genetic programming
    Hirsch, L
    Saeedi, M
    Hirsch, R
    APPLIED ARTIFICIAL INTELLIGENCE, 2005, 19 (07) : 659 - 676
  • [26] Adaptive Bi-objective Genetic Programming for Data-Driven System Modeling
    Bevilacqua, Vitoantonio
    Nuzzolese, Nicola
    Mininno, Ernesto
    Iacca, Giovanni
    INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2016, PT III, 2016, 9773 : 248 - 259
  • [27] Regular Expression Based Medical Text Classification Using Constructive Heuristic Approach
    Cui, Menglin
    Bai, Ruibin
    Lu, Zheng
    Li, Xiang
    Aickelin, Uwe
    Ge, Peiming
    IEEE ACCESS, 2019, 7 : 147892 - 147904
  • [28] Using data-driven feature enrichment of text representation and ensemble technique for sentence-level polarity classification
    Zhang, Pu
    He, Zhongshi
    JOURNAL OF INFORMATION SCIENCE, 2015, 41 (04) : 531 - 549
  • [29] Data-Driven Fault Classification Using Support Vector Machines
    Jallepalli, Deepthi
    Kakhki, Fatemeh Davoudi
    INTELLIGENT HUMAN SYSTEMS INTEGRATION 2021, 2021, 1322 : 316 - 322
  • [30] Automatic Classification of Data-Driven Respiratory Waveforms Using AI
    Walker, M. D.
    Su, K.
    Wollenweber, S. D.
    Johnsen, R.
    McGowan, D. R.
    EUROPEAN JOURNAL OF NUCLEAR MEDICINE AND MOLECULAR IMAGING, 2020, 47 (SUPPL 1) : S485 - S485