Data preparation for KDD through automatic reasoning based on description logic

被引:11
|
作者
Lara, Juan A. [1 ]
Lizcano, David [1 ]
Aurora Martinez, Ma [1 ]
Pazos, Juan [2 ]
机构
[1] Univ Distancia Madrid, Fac Ensenanzas Tecn, Madrid 28400, Spain
[2] Univ Politecn Madrid, Sch Comp Sci, E-28660 Madrid, Spain
关键词
KDD; Data preparation; Data mining; Description logic; Automatic reasoning; PROTOTYPE REDUCTION SCHEMES; TIME-SERIES;
D O I
10.1016/j.is.2014.03.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Without data preparation, data mining algorithms cannot operate on data within the knowledge discovery in databases (KDD) process. In fact, the success of later KDD phases largely depends on the data preparation stage. The use of mechanisms for automatically preparing data saves a lot of time and resources within the KDD process. These resources will then be available for use at later, less automatable stages, for example, during results interpretation. We have proposed a general-purpose mechanism applicable to multiple domains in order to improve the data preparation phase in the KDD process. This mechanism processes and automatically converts input data to a suitable format for the application of different data preparation techniques based on a known syntax. It is based on the use of description logic Taking a generic UML2 data model as a reference, this mechanism is able to check whether any XML data source whatsoever can be transformed and modelled as a subsumption or instance of the above UML2 model. Thus it automatically identifies a consistent, non-ambiguous and finite set of XLST transformations which are used to prepare the data for the application of data mining techniques, obviating the need to expend resources on the preliminary preparation and formatting stage. The proposed mechanism was applied on structurally complex data from four different domains. In order to test the validity of the proposal, we have applied data mining techniques to extract knowledge from the prepared data. The sound results of applying our proposal to several different domains confirm that it is applicable to any XML data source, as well as being correct, computationally efficient and saving time during the data preparation phase. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:54 / 72
页数:19
相关论文
共 50 条
  • [21] REASONING WITH THE FUZZY DESCRIPTION LOGIC fZS I
    Zhao, Jidi
    Boley, Harold
    Du, Weichang
    ICFC 2010/ ICNC 2010: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON FUZZY COMPUTATION AND INTERNATIONAL CONFERENCE ON NEURAL COMPUTATION, 2010, : 21 - 30
  • [22] Reasoning within extended fuzzy description logic
    Lu, Jianjiang
    Li, Yanhui
    Zhou, Bo
    Kang, Dazhou
    KNOWLEDGE-BASED SYSTEMS, 2009, 22 (01) : 28 - 37
  • [23] DESCRIPTION PLAUSIBLE LOGIC PROGRAMS FOR STREAM REASONING
    Letia, Ioan Alfred
    Groza, Adrian
    ICAART: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2012, : 560 - 566
  • [24] PSPACE reasoning with the description logic ALCF(D)
    Lutz, C
    LOGIC JOURNAL OF THE IGPL, 2002, 10 (05) : 535 - 568
  • [25] Plausible Description Logic Programs for Stream Reasoning
    Groza, Adrian
    Letia, Ioan Alfred
    FUTURE INTERNET, 2012, 4 (04): : 865 - 881
  • [26] Reasoning in Description Logic Ontologies for Privacy Management
    Adrian Nuradiansyah
    KI - Künstliche Intelligenz, 2020, 34 : 411 - 415
  • [27] Algebraic tableau reasoning for the description logic SHOQ
    Faddoul, Jocelyne
    Haarslev, Volker
    JOURNAL OF APPLIED LOGIC, 2010, 8 (04) : 334 - 355
  • [28] Error-tolerant reasoning in the description logic Ε
    Ludwig, Michel
    Peñaloza, Rafael
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8761 : 107 - 121
  • [29] Representation and reasoning on RBAC: A description logic approach
    Zhao, C
    Heilili, N
    Liu, SP
    Lin, ZQ
    THEORETICAL ASPECTS OF COMPUTING - ICTAC 2005, 2005, 3722 : 381 - 393
  • [30] On the Semantics of Defeasible Reasoning for Description Logic Ontologies
    Viet-Hoai To
    Bac Le
    Ikeda, Mitsuru
    KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2013), VOL 1, 2014, 244 : 51 - 63