Ensemble based Cost-Sensitive Feature Selection for Consolidated Knowledge Base Creation

被引:3
|
作者
Ali, Syed Imran [1 ]
Lee, Sungyoung [1 ]
机构
[1] Kyung Hee Univ, Dept Comp Sci & Engn, Yongin, Gyeonggi Do, South Korea
关键词
Data driven system; Feature Selection; Interpretable Machine Learning models; Ensemble Models; Decision Tree Models;
D O I
10.1109/imcom48794.2020.9001751
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This is paper proposes a knowledge construction system. The key objective of the system is to extract knowledge from structured data which is generally available in the form of electronic medical records (EMR). In this regard, the main focus of the research is to design and develop a domain-independent system that is capable of assisting the domain expert(s) in gaining non-trivial insights from the underlying EMR data. It is important to note that most of the research in the domain of cost-sensitive feature selection relies on black-box models which only provide a prediction of a final class label. Whereas, the goal of this research is to acquire insights for domain experts such as chronic kidney disease classification. This goal is achieved by designing and developing a knowledge construction system that is based on a two-stage methodology. Stage one deals with identifying salient cost-sensitive features in the EMR data, whereas, stage-two deals with consolidating knowledge (i.e. in the form of production rules) from a set of interpretable machine learning models. Finally, in order to demonstrate the efficacy of the system a chronic kidney disease case study is adopted.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Cost-sensitive feature selection based on Adaptive Hunting Optimization
    Liang, Yixuan
    2024 4TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND ARTIFICIAL INTELLIGENCE, CCAI 2024, 2024, : 546 - 551
  • [2] Cost-Sensitive Feature Selection on Heterogeneous Data
    Qian, Wenbin
    Shu, Wenhao
    Yang, Jun
    Wang, Yinglong
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PART II, 2015, 9078 : 397 - 408
  • [3] Cost-sensitive selection of variables by ensemble of model sequences
    Donghui Yan
    Zhiwei Qin
    Songxiang Gu
    Haiping Xu
    Ming Shao
    Knowledge and Information Systems, 2021, 63 : 1069 - 1092
  • [4] Cost-sensitive selection of variables by ensemble of model sequences
    Yan, Donghui
    Qin, Zhiwei
    Gu, Songxiang
    Xu, Haiping
    Shao, Ming
    KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 63 (05) : 1069 - 1092
  • [5] Rough sets and Laplacian score based cost-sensitive feature selection
    Yu, Shenglong
    Zhao, Hong
    PLOS ONE, 2018, 13 (06):
  • [6] COST-SENSITIVE FEATURE SELECTION BASED ON LABEL SIGNIFICANCE AND POSITIVE REGION
    Huang, Jintao
    Qian, Wenbin
    Wu, Binglong
    Wang, Yinglong
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), 2019, : 403 - 409
  • [7] G-Forest: An ensemble method for cost-sensitive feature selection in gene expression microarrays
    Abdulla, Mai
    Khasawneh, Mohammad T.
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2020, 108 (108)
  • [8] Cost-Sensitive Ensemble Feature Ranking and Automatic Threshold Selection for Chronic Kidney Disease Diagnosis
    Imran Ali, Syed
    Ali, Bilal
    Hussain, Jamil
    Hussain, Musarrat
    Satti, Fahad Ahmed
    Park, Gwang Hoon
    Lee, Sungyoung
    APPLIED SCIENCES-BASEL, 2020, 10 (16):
  • [9] Cost-Sensitive Feature Selection for Class Imbalance Problem
    Bach, Malgorzata
    Werner, Aleksandra
    INFORMATION SYSTEMS ARCHITECTURE AND TECHNOLOGY, PT I, 2018, 655 : 182 - 194
  • [10] Cost-sensitive Feature Selection for Support Vector Machines
    Benitez-Pena, S.
    Blanquero, R.
    Carrizosa, E.
    Ramirez-Cobo, P.
    COMPUTERS & OPERATIONS RESEARCH, 2019, 106 : 169 - 178