Data-Driven Regular Expressions Evolution for Medical Text Classification Using Genetic Programming

被引:0
|
作者
Liu, Jiandong [1 ]
Bai, Ruibin [1 ]
Lu, Zheng [1 ]
Ge, Peiming [2 ]
Aickelin, Uwe [3 ]
Liu, Daoyun [2 ]
机构
[1] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo, Peoples R China
[2] Ping An Hlth Cloud Co Ltd China, Techonol Dept, Shanghai, Peoples R China
[3] Univ Melbourne, Sch Comp & Informat Syst, Melbourne, Vic, Australia
关键词
text classification; genetic programming; co-occurrence matrix; EXPERT-SYSTEM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In medical fields, text classification is one of the most important tasks that can significantly reduce human workload through structured information digitization and intelligent decision support. Despite the popularity of learning-based text classification techniques, it is hard for human to understand or manually fine-tune the classification for better precision and recall, due to the black box nature of learning. This study proposes a novel regular expression-based text classification method making use of genetic programming (GP) approaches to evolve regular expressions that can classify a given medical text inquiry with satisfaction. Given a seed population of regular expressions (randomly initialized or manually constructed by experts), our method evolves a population of regular expressions, using a novel regular expression syntax and a series of carefully chosen reproduction operators. Our method is evaluated with real-life medical text inquiries from an online healthcare provider and shows promising performance. More importantly, our method generates classifiers that can be fully understood, checked and updated by medical doctors, which are fundamentally crucial for medical related practices.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Data-driven evolution of data mining algorithms
    Smyth, P
    Pregibon, D
    Faloutsos, C
    COMMUNICATIONS OF THE ACM, 2002, 45 (08) : 33 - 37
  • [42] iSnap: Evolution and Evaluation of a Data-Driven Hint System for Block-Based Programming
    Marwan, Samiha
    Price, Thomas W. W.
    IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES, 2023, 16 (03): : 399 - 413
  • [43] Complexity-driven Evolution of Decision Graphs for Classification of Medical Data
    Podgorelec, Vili
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2005, 29 (01): : 41 - 51
  • [44] Evolving data classification programs using genetic parallel programming
    Cheang, SM
    Lee, KH
    Leung, KS
    CEC: 2003 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-4, PROCEEDINGS, 2003, : 248 - 255
  • [45] Scaling Genetic Programming for Data Classification using MapReduce Methodology
    Al-Madi, Nailah
    Ludwig, Simone A.
    2013 WORLD CONGRESS ON NATURE AND BIOLOGICALLY INSPIRED COMPUTING (NABIC), 2013, : 132 - 139
  • [46] Active Learning for Biomedical Text Classification Based on Automatically Generated Regular Expressions
    Flores, Christopher A.
    Figueroa, Rosa L.
    Pezoa, Jorge E.
    Flores, Christopher A. (christopher.flores@biomedica.udec.cl), 1600, Institute of Electrical and Electronics Engineers Inc. (09): : 38767 - 38777
  • [47] Active Learning for Biomedical Text Classification Based on Automatically Generated Regular Expressions
    Flores, Christopher A.
    Figueroa, Rosa L.
    Pezoa, Jorge E.
    IEEE ACCESS, 2021, 9 : 38767 - 38777
  • [48] On data-driven controller synthesis with regular language specifications
    Pola, Giordano
    Masciulli, Tommaso
    De Santis, Elena
    Di Benedetto, Maria Domenica
    IFAC PAPERSONLINE, 2020, 53 (02): : 3928 - 3933
  • [49] A first attempt at constructing genetic programming expressions for EEG classification
    Estébanez, C
    Valls, JM
    Aler, R
    Galván, IM
    ARTIFICIAL NEURAL NETWORKS: BIOLOGICAL INSPIRATIONS - ICANN 2005, PT 1, PROCEEDINGS, 2005, 3696 : 665 - 670
  • [50] Geo-text data and data-driven geospatial semantics
    Hu, Yingjie
    GEOGRAPHY COMPASS, 2018, 12 (11):