Data preparation for KDD through automatic reasoning based on description logic

被引:11
|
作者
Lara, Juan A. [1 ]
Lizcano, David [1 ]
Aurora Martinez, Ma [1 ]
Pazos, Juan [2 ]
机构
[1] Univ Distancia Madrid, Fac Ensenanzas Tecn, Madrid 28400, Spain
[2] Univ Politecn Madrid, Sch Comp Sci, E-28660 Madrid, Spain
关键词
KDD; Data preparation; Data mining; Description logic; Automatic reasoning; PROTOTYPE REDUCTION SCHEMES; TIME-SERIES;
D O I
10.1016/j.is.2014.03.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Without data preparation, data mining algorithms cannot operate on data within the knowledge discovery in databases (KDD) process. In fact, the success of later KDD phases largely depends on the data preparation stage. The use of mechanisms for automatically preparing data saves a lot of time and resources within the KDD process. These resources will then be available for use at later, less automatable stages, for example, during results interpretation. We have proposed a general-purpose mechanism applicable to multiple domains in order to improve the data preparation phase in the KDD process. This mechanism processes and automatically converts input data to a suitable format for the application of different data preparation techniques based on a known syntax. It is based on the use of description logic Taking a generic UML2 data model as a reference, this mechanism is able to check whether any XML data source whatsoever can be transformed and modelled as a subsumption or instance of the above UML2 model. Thus it automatically identifies a consistent, non-ambiguous and finite set of XLST transformations which are used to prepare the data for the application of data mining techniques, obviating the need to expend resources on the preliminary preparation and formatting stage. The proposed mechanism was applied on structurally complex data from four different domains. In order to test the validity of the proposal, we have applied data mining techniques to extract knowledge from the prepared data. The sound results of applying our proposal to several different domains confirm that it is applicable to any XML data source, as well as being correct, computationally efficient and saving time during the data preparation phase. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:54 / 72
页数:19
相关论文
共 50 条
  • [1] Visual Data Integration based on Description Logic Reasoning
    Caruccio, Loredana
    Deufemia, Vincenzo
    Polese, Giuseppe
    PROCEEDINGS OF THE 18TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM (IDEAS14), 2014, : 19 - 28
  • [2] Prolog Based Description Logic Reasoning
    Lukacsy, Gergely
    Szeredi, Peter
    Kadar, Balazs
    LOGIC PROGRAMMING, PROCEEDINGS, 2008, 5366 : 455 - 469
  • [3] Study of Semantic Reasoning based on Ontology Description Logic
    Wang, Jinhuan
    Li, Baomin
    PROCEEDINGS OF 2016 IEEE ADVANCED INFORMATION MANAGEMENT, COMMUNICATES, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IMCEC 2016), 2016, : 1869 - 1872
  • [4] Parallelizing tableaux-based description logic reasoning
    Liebig, Thorsten
    Mueller, Felix
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2007: OTM 2007 WORKSHOPS, PT 2, PROCEEDINGS, 2007, 4806 : 1135 - 1144
  • [5] Resolution based explanations for reasoning in the description logic ALC
    Deng, Xi
    Haarslev, Volker
    Shiri, Nematollaah
    CANADIAN SEMANTIC WEB, 2006, 2 : 189 - +
  • [6] A Diagnostics Framework based on Abductive Description Logic Reasoning
    Hubauer, Thomas M.
    Grimm, Stephan
    Lamparter, Steffen
    Roshchin, Mikhail
    2012 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2012, : 1046 - 1053
  • [7] A Description Logic for Analogical Reasoning
    Schockaert, Steven
    Ibanez-Garcia, Yazmin
    Gutierrez-Basulto, Victor
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2040 - 2046
  • [8] Description Logic reasoning in Prolog
    Lukacsy, Gergely
    LOGIC PROGRAMMING, PROCEEDINGS, 2006, 4079 : 463 - 464
  • [9] Description logic with default reasoning
    Dong, Ming-Kai
    Jiang, Yun-Cheng
    Shi, Zhong-Zhi
    Jisuanji Xuebao/Chinese Journal of Computers, 2003, 26 (06): : 729 - 736
  • [10] Fuzzy Reasoning in Description Logic
    Gasmi, Mohamed
    Bourahla, Mustapha
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2016, 16 (07): : 71 - 82