A hybrid and exploratory approach to knowledge discovery in metabolomic data

被引:9
|
作者
Grissa, Dhouha [1 ,4 ]
Comte, Blandine [1 ]
Petera, Melanie [2 ]
Pujos-Guillot, Estelle [1 ]
Napoli, Amedeo [3 ]
机构
[1] Univ Clermont Auvergne, INRA, UNH, Mapping, F-63000 Clermont Ferrand, France
[2] Univ Clermont Auvergne, INRA, UNH, Plateforme Explorat Metab,MetaboHUB Clermont, F-63000 Clermont Ferrand, France
[3] Univ Lorraine, CNRS, INRIA, LORIA, F-54000 Nancy, France
[4] Univ Copenhagen, Novo Nordisk Fdn, Ctr Prot Res, Blegdamsvej 3B, DK-2200 Copenhagen, Denmark
关键词
Hybrid knowledge discovery; Pattern mining; Formal concept analysis; Data and pattern exploration; Metabolomic data; Classification; Visualization; Interpretation; FORMAL CONCEPT ANALYSIS; FEATURE-SELECTION;
D O I
10.1016/j.dam.2018.11.025
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In this paper, we propose a hybrid and exploratory knowledge discovery approach for analyzing metabolomic complex data based on a combination of supervised classifiers, pattern mining and Formal Concept Analysis (FCA). The approach is based on three main operations, preprocessing, classification, and postprocessing. Classifiers are applied to datasets of the form individuals x features and produce sets of ranked features which are further analyzed. Pattern mining and FCA are used to provide a complementary analysis and support for visualization. A practical application of this framework is presented in the context of metabolomic data, where two interrelated problems are considered, discrimination and prediction of class membership. The dataset is characterized by a small set of individuals and a large set of features, in which predictive biomarkers of clinical outcomes should be identified. The problems of combining numerical and symbolic data mining methods, as well as discrimination and prediction, are detailed and discussed. Moreover, it appears that visualization based on FCA can be used both for guiding knowledge discovery and for interpretation by domain analysts. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:103 / 116
页数:14
相关论文
共 50 条
  • [41] Knowledge discovery in astronomical data
    Zhang, Yanxia
    Zheng, Hongwen
    Zhao, Yongheng
    ADVANCED SOFTWARE AND CONTROL FOR ASTRONOMY II, PTS 1 & 2, 2008, 7019
  • [42] KNOWLEDGE DISCOVERY IN ENVIRONMENTAL DATA
    Izquierdo, Joaquin
    Diaz, Jose L.
    Perez, Rafael
    Amparo Lopez, P.
    Mora, Jose J.
    INTEGRATED WATER MANAGEMENT: PRACTICAL EXPERIENCES AND CASE STUDIES, 2008, 80 : 51 - 68
  • [43] Knowledge discovery and data mining
    Lee, HY
    Lu, HJ
    Motoda, H
    KNOWLEDGE-BASED SYSTEMS, 1998, 10 (07) : 401 - 402
  • [44] Data warehousing and knowledge discovery
    Mohania, M
    Tjoa, AM
    Kambayashi, Y
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2000, 15 (01) : 5 - 6
  • [45] Big Data knowledge discovery
    Xhafa, Fatos
    Taniar, David
    KNOWLEDGE-BASED SYSTEMS, 2015, 79 : 1 - 2
  • [46] Knowledge discovery and data mining
    Brodley, CE
    Lane, T
    Stough, TM
    AMERICAN SCIENTIST, 1999, 87 (01) : 54 - 61
  • [47] Knowledge Discovery in Simulation Data
    Feldkamp, Niclas
    Bergmann, Soeren
    Strassburger, Steffen
    ACM TRANSACTIONS ON MODELING AND COMPUTER SIMULATION, 2020, 30 (04):
  • [48] Data mining and knowledge discovery
    Trybula, WJ
    ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 1997, 32 : 197 - 229
  • [49] Knowledge Discovery in Spatial Data
    Ye, Xinyue
    REGIONAL STUDIES, 2011, 45 (06) : 872 - 873
  • [50] Knowledge discovery from data?
    Pazzani, Michael J.
    IEEE Intelligent Systems and Their Applications, 2000, 15 (02): : 10 - 13