Better Interpretable Models for Proteomics Data Analysis Using Rule-Based Mining

被引:1
|
作者
Jayrannejad, Fahrnaz [1 ]
Conrad, Tim O. F. [1 ,2 ]
机构
[1] Zuse Inst Berlin, Takustr 7, D-14195 Berlin, Germany
[2] Free Univ Berlin, Dept Math, Arnimallee 6, Berlin, Germany
关键词
Bioinformatics; Machine learning; Feature selection; Classification; Association rule mining; Jumping emerging pattern; Proteomics; Mass spectrometry; Clinical data; Biomarker; REGULARIZATION PATHS;
D O I
10.1007/978-3-319-69775-8_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in -omics technology has yielded in large data-sets in many areas of biology, such as mass spectrometry based proteomics. However, analyzing this data is still a challenging task mainly due to the very high dimensionality and high noise content of the data. One of the main objectives of the analysis is the identification of relevant patterns (or features) which can be used for classification of new samples to healthy or diseased. So, a method is required to find easily interpretable models from this data. To gain the above mentioned goal, we have adapted the disjunctive association rule mining algorithm, TitanicOR, to identify emerging patterns from our mass spectrometry proteomics data-sets. Comparison to five state-of-the-art methods shows that our method is advantageous them in terms of identifying the inter-dependency between the features and the TP-rate and precision of the features selected. We further demonstrate the applicability of our algorithm to one previously published clinical data-set.
引用
收藏
页码:67 / 88
页数:22
相关论文
共 50 条
  • [31] Efficient rule-based attribute-oriented induction for data mining
    Cheung, DW
    Hwang, HY
    Fu, AW
    Han, JW
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2000, 15 (02) : 175 - 200
  • [32] Efficient Rule-Based Attribute-Oriented Induction for Data Mining
    David W. Cheung
    H.Y. Hwang
    Ada W. Fu
    Jiawei Han
    Journal of Intelligent Information Systems, 2000, 15 : 175 - 200
  • [33] Facility layout using weighted association rule-based data mining algorithms: Evaluation with simulation
    Altuntas, Serkan
    Selim, Hasan
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (01) : 3 - 13
  • [34] Containment in Rule-Based Models
    Thompson-Walsh, C. D.
    Hayman, J.
    Winskel, G.
    ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2012, 284 : 125 - 137
  • [35] Parameter Synthesis and Robustness Analysis of Rule-Based Models
    Trojak, Matej
    Safranek, David
    Mertova, Lukrecia
    Brim, Lubos
    NASA FORMAL METHODS (NFM 2020), 2020, 12229 : 41 - 59
  • [36] Rule-based dependency models for security protocol analysis
    Chen, Qingfeng
    Zhang, Shichao
    Chen, Yi-Ping Phoebe
    INTEGRATED COMPUTER-AIDED ENGINEERING, 2008, 15 (04) : 369 - 380
  • [37] Reinforced rule-based fuzzy models: Design and analysis
    Kim, Eun-Hu
    Oh, Sung-Kwun
    Pedrycz, Witold
    KNOWLEDGE-BASED SYSTEMS, 2017, 119 : 44 - 58
  • [38] Counterfactual Resimulation for Causal Analysis of Rule-Based Models
    Laurent, Jonathan
    Yang, Jean
    Fontana, Walter
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 1882 - 1890
  • [39] Data undersampling models for the efficient rule-based retrosynthetic planning
    Park, Min Sik
    Lee, Dongseon
    Kwon, Youngchun
    Kim, Eunji
    Choi, Youn-Suk
    PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2021, 23 (46) : 26510 - 26518
  • [40] Design of Distributed Rule-Based Models in the Presence of Large Data
    E, Hanyu
    Cui, Ye
    Pedrycz, Witold
    Li, Zhiwu
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2023, 31 (07) : 2479 - 2486