A hybrid and exploratory approach to knowledge discovery in metabolomic data

被引:9
|
作者
Grissa, Dhouha [1 ,4 ]
Comte, Blandine [1 ]
Petera, Melanie [2 ]
Pujos-Guillot, Estelle [1 ]
Napoli, Amedeo [3 ]
机构
[1] Univ Clermont Auvergne, INRA, UNH, Mapping, F-63000 Clermont Ferrand, France
[2] Univ Clermont Auvergne, INRA, UNH, Plateforme Explorat Metab,MetaboHUB Clermont, F-63000 Clermont Ferrand, France
[3] Univ Lorraine, CNRS, INRIA, LORIA, F-54000 Nancy, France
[4] Univ Copenhagen, Novo Nordisk Fdn, Ctr Prot Res, Blegdamsvej 3B, DK-2200 Copenhagen, Denmark
关键词
Hybrid knowledge discovery; Pattern mining; Formal concept analysis; Data and pattern exploration; Metabolomic data; Classification; Visualization; Interpretation; FORMAL CONCEPT ANALYSIS; FEATURE-SELECTION;
D O I
10.1016/j.dam.2018.11.025
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In this paper, we propose a hybrid and exploratory knowledge discovery approach for analyzing metabolomic complex data based on a combination of supervised classifiers, pattern mining and Formal Concept Analysis (FCA). The approach is based on three main operations, preprocessing, classification, and postprocessing. Classifiers are applied to datasets of the form individuals x features and produce sets of ranked features which are further analyzed. Pattern mining and FCA are used to provide a complementary analysis and support for visualization. A practical application of this framework is presented in the context of metabolomic data, where two interrelated problems are considered, discrimination and prediction of class membership. The dataset is characterized by a small set of individuals and a large set of features, in which predictive biomarkers of clinical outcomes should be identified. The problems of combining numerical and symbolic data mining methods, as well as discrimination and prediction, are detailed and discussed. Moreover, it appears that visualization based on FCA can be used both for guiding knowledge discovery and for interpretation by domain analysts. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:103 / 116
页数:14
相关论文
共 50 条
  • [21] Synchronized biological knowledge and data management: A hybrid approach
    Stephan, EG
    Chin, G
    Corrigan, AL
    Klicker, KR
    Sofia, HJ
    METMBS '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MATHEMATICS AND ENGINEERING TECHNIQUES IN MEDICINE AND BIOLOGICAL SCIENCES, 2004, : 105 - 110
  • [22] Hybrid Approach for Optimal Management and Querying of Data and Knowledge
    Hettiarachchi, Achini Sandeepani
    Goonatillake, Jeevani
    Wikramanayake, Gihan
    Walisadeera, Anusha
    2016 SIXTEENTH INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER) - 2016, 2016, : 176 - 185
  • [23] Pattern discovery and exploratory data mining
    Wong, AKC
    PROCEEDINGS OF THE 7TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2003, : 49 - 52
  • [24] Hybrid rough-genetic algorithm for knowledge discovery from large data
    Chakraborty, G
    Chakraborty, B
    SOFT COMPUTING AS TRANSDISCIPLINARY SCIENCE AND TECHNOLOGY, 2005, : 904 - 913
  • [25] Data Mining and Knowledge Discovery: An Approach for Sustaining Development in GCC Countries
    Al-Roubaie, Amer
    Abdul-Wahab, Rasha Shaker
    IACSIT-SC 2009: INTERNATIONAL ASSOCIATION OF COMPUTER SCIENCE AND INFORMATION TECHNOLOGY - SPRING CONFERENCE, 2009, : 240 - +
  • [26] Hypoplastic left heart syndrome: knowledge discovery with a data mining approach
    Kusiak, A
    Caldarone, CA
    Kelleher, MD
    Lamb, FS
    Persoon, TJ
    Burns, A
    COMPUTERS IN BIOLOGY AND MEDICINE, 2006, 36 (01) : 21 - 40
  • [27] Differential diagnosis of dementia: A Knowledge Discovery and Data Mining (KDD) approach
    Mani, S
    Shankle, WR
    Pazzani, MJ
    Smyth, P
    Dick, MB
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1997, : 875 - 875
  • [28] An ontology engineering approach for knowledge discovery from data in evolving domains
    Gottgtroy, P
    Kasabov, N
    Macdonell, S
    DATA MINING IV, 2004, 7 : 43 - 52
  • [29] A data mining approach to knowledge discovery from multidimensional cube structures
    Usman, Muhammad
    Pears, Russel
    Fong, A. C. M.
    KNOWLEDGE-BASED SYSTEMS, 2013, 40 : 36 - 49
  • [30] A Novel Approach of Data Sanitization by Noise Addition and Knowledge Discovery by Clustering
    Abdullah, Hadi
    Siddiqi, Ahsan
    Bajaber, Fuad
    2015 WORLD SYMPOSIUM ON COMPUTER NETWORKS AND INFORMATION SECURITY (WSCNIS), 2015,