Better Interpretable Models for Proteomics Data Analysis Using Rule-Based Mining

被引:1
|
作者
Jayrannejad, Fahrnaz [1 ]
Conrad, Tim O. F. [1 ,2 ]
机构
[1] Zuse Inst Berlin, Takustr 7, D-14195 Berlin, Germany
[2] Free Univ Berlin, Dept Math, Arnimallee 6, Berlin, Germany
关键词
Bioinformatics; Machine learning; Feature selection; Classification; Association rule mining; Jumping emerging pattern; Proteomics; Mass spectrometry; Clinical data; Biomarker; REGULARIZATION PATHS;
D O I
10.1007/978-3-319-69775-8_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in -omics technology has yielded in large data-sets in many areas of biology, such as mass spectrometry based proteomics. However, analyzing this data is still a challenging task mainly due to the very high dimensionality and high noise content of the data. One of the main objectives of the analysis is the identification of relevant patterns (or features) which can be used for classification of new samples to healthy or diseased. So, a method is required to find easily interpretable models from this data. To gain the above mentioned goal, we have adapted the disjunctive association rule mining algorithm, TitanicOR, to identify emerging patterns from our mass spectrometry proteomics data-sets. Comparison to five state-of-the-art methods shows that our method is advantageous them in terms of identifying the inter-dependency between the features and the TP-rate and precision of the features selected. We further demonstrate the applicability of our algorithm to one previously published clinical data-set.
引用
收藏
页码:67 / 88
页数:22
相关论文
共 50 条
  • [41] A rule-based subset generation method for product data models
    Yang, Donghoon
    Eastman, Charles M.
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2007, 22 (02) : 133 - 148
  • [42] ON THE POWER OF RULE-BASED QUERY LANGUAGES FOR NESTED DATA MODELS
    VADAPARTY, KV
    JOURNAL OF LOGIC PROGRAMMING, 1994, 21 (03): : 155 - 175
  • [43] A rule-based data quality startup using PyCLIPS
    DuPlain, R. F.
    Radziwill, N. M.
    Shelton, A. L.
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XVII, 2008, 394 : 723 - +
  • [44] Automated Rule-Based Data Cleaning Using NLP
    Mavrogiorgos, Konstantinos
    Mavrogiorgou, Argyro
    Kiourtis, Athanasios
    Zafeiropoulos, Nikolaos
    Kleftakis, Spyridon
    Kyriazis, Dimosthenis
    2022 32ND CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), 2022, : 162 - 168
  • [45] On Equivalence of FIS and ELM for Interpretable Rule-Based Knowledge Representation
    Wong, Shen Yuong
    Yap, Keem Siah
    Yap, Hwa Jen
    Tan, Shing Chiang
    Chang, Siow Wee
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (07) : 1417 - 1430
  • [46] Umigon-lexicon: rule-based model for interpretable sentiment analysis and factuality categorization
    Levallois, Clement
    LANGUAGE RESOURCES AND EVALUATION, 2024,
  • [47] An interpretable fuzzy rule-based classification methodology for medical diagnosis
    Gadaras, Ioannis
    Mikhailov, Ludmil
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2009, 47 (01) : 25 - 41
  • [48] Fast rule-based bioactivity prediction using associative classification mining
    Pulan Yu
    David J Wild
    Journal of Cheminformatics, 4
  • [49] Aspect Mining meets Rule-based Refactoring
    Vidal, Santiago A.
    Abait, Esteban S.
    Marcos, Claudia
    Casas, Sandra
    Pace, J. Andres Diaz
    PLATE09: PRACTICES OF LINKING ASPECT TECHNOLOGY AND EVOLUTION, 2009, : 23 - 27
  • [50] Analysis of Factors Affecting the Over-Representation of Sequential Crashes in Freeway Tunnels: Using Rule-Based Data Mining Method
    Li, Shun
    Huang, Shuai
    Wang, Jie
    He, Shijian
    Journal of Advanced Transportation, 2023, 2023