Learning Regular Expressions for Interpretable Medical Text Classification Using a Pool-based Simulated Annealing Approach

被引:0
|
作者
Tu, Chaofan [1 ]
Cui, Menglin [1 ]
机构
[1] Univ Nottingham, Sch Comp Sci, Ningbo, Peoples R China
关键词
simulated annealing; regular expression; medical text classification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a rule-based engine composed of high-quality and interpretable regular expressions for medical text classification. The regular expressions are auto-generated by a constructive heuristic method and optimized using a Pool-based Simulated Annealing (PSA) approach. Although existing Deep Neural Network (DNN) methods present high-quality performance in most Natural Language Processing (NLP) applications, the solutions are regarded as uninterpretable "black boxes" to humans. Therefore, rule-based methods are often introduced when interpretable solutions are needed, especially in the medical field. However, the construction of regular expressions can be extremely labor-intensive for large data sets. This research aims to reduce the manual efforts while maintaining high-quality solutions. The Pool-based Simulated Annealing method is proposed to automatically optimize the performance of machine-generated regular expressions without human interference. The proposed method is tested on real-life data provided by one of China's largest online medical platforms. Experimental results show that the proposed PSA method further improves the performance of initial machine-generated regular expressions compared with other meta-heuristics such as Genetic Programming. We also believe that the proposed method can serve as a vital complementary tool for the existing machine learning approaches in text classification applications when high levels of interpretability of the solutions are required.
引用
收藏
页数:7
相关论文
共 50 条
  • [21] A Medical Case-Based Reasoning Approach Using Image Classification and Text Information for Recommendation
    Nasiri, Sara
    Zenkert, Johannes
    Fathi, Madjid
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, PT II, 2015, 9095 : 43 - 55
  • [22] A Hybrid RNN based Deep Learning Approach for Text Classification
    Sunagar, Pramod
    Kanavalli, Anita
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (06) : 289 - 295
  • [23] Interpretable Machine Learning for Personalized Medical Recommendations: A LIME-Based Approach
    Wu, Yuanyuan
    Zhang, Linfei
    Bhatti, Uzair Aslam
    Huang, Mengxing
    DIAGNOSTICS, 2023, 13 (16)
  • [24] Medical image segmentation based on simulated annealing and opposition-based learning island algorithm
    Jiming, M. A.
    Duan, HongYu
    Wang, YuFan
    Wang, LiNa
    PLOS ONE, 2024, 19 (07):
  • [25] Making sense of the black-boxes: Toward interpretable text classification using deep learning models
    Tao, Jie
    Zhou, Lina
    Hickey, Kevin
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2023, 74 (06) : 685 - 700
  • [26] Personality Classification from Online Text using Machine Learning Approach
    Khan, Alam Sher
    Ahmad, Hussain
    Asghar, Muhammad Zubair
    Saddozai, Furcian Khan
    Arir, Areeba
    Khalid, Hassan Ali
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (03) : 460 - 476
  • [27] Medical Text Classification Using Hybrid Deep Learning Models with Multihead Attention
    Prabhakar, Sunil Kumar
    Won, Dong-Ok
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [28] Distributed Text Classification With an Ensemble Kernel-Based Learning Approach
    Silva, Catarina
    Lotric, Uros
    Ribeiro, Bernardete
    Dobnikar, Andrej
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2010, 40 (03): : 287 - 297
  • [29] Medical Text Classification Based on an Optimized Machine Learning and External Semantic Resource
    Gasmi, Karim
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (16)
  • [30] Classification Methods of Text Documents Using Ontology Based Approach
    Lytvyn, Vasyl
    Vysotska, Victoria
    Veres, Oleh
    Rishnyak, Ihor
    Rishnyak, Halya
    ADVANCES IN INTELLIGENT SYSTEMS AND COMPUTING, CSIT 2016, 2017, 512 : 229 - 240