Data preparation for KDD through automatic reasoning based on description logic

被引:11
|
作者
Lara, Juan A. [1 ]
Lizcano, David [1 ]
Aurora Martinez, Ma [1 ]
Pazos, Juan [2 ]
机构
[1] Univ Distancia Madrid, Fac Ensenanzas Tecn, Madrid 28400, Spain
[2] Univ Politecn Madrid, Sch Comp Sci, E-28660 Madrid, Spain
关键词
KDD; Data preparation; Data mining; Description logic; Automatic reasoning; PROTOTYPE REDUCTION SCHEMES; TIME-SERIES;
D O I
10.1016/j.is.2014.03.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Without data preparation, data mining algorithms cannot operate on data within the knowledge discovery in databases (KDD) process. In fact, the success of later KDD phases largely depends on the data preparation stage. The use of mechanisms for automatically preparing data saves a lot of time and resources within the KDD process. These resources will then be available for use at later, less automatable stages, for example, during results interpretation. We have proposed a general-purpose mechanism applicable to multiple domains in order to improve the data preparation phase in the KDD process. This mechanism processes and automatically converts input data to a suitable format for the application of different data preparation techniques based on a known syntax. It is based on the use of description logic Taking a generic UML2 data model as a reference, this mechanism is able to check whether any XML data source whatsoever can be transformed and modelled as a subsumption or instance of the above UML2 model. Thus it automatically identifies a consistent, non-ambiguous and finite set of XLST transformations which are used to prepare the data for the application of data mining techniques, obviating the need to expend resources on the preliminary preparation and formatting stage. The proposed mechanism was applied on structurally complex data from four different domains. In order to test the validity of the proposal, we have applied data mining techniques to extract knowledge from the prepared data. The sound results of applying our proposal to several different domains confirm that it is applicable to any XML data source, as well as being correct, computationally efficient and saving time during the data preparation phase. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:54 / 72
页数:19
相关论文
共 50 条
  • [11] Concept learning in the description logic ALCH(D) based onminimal model reasoning for RDF data
    1600, Japanese Society for Artificial Intelligence (29):
  • [12] Semantics and reasoning of description logic μALCIO
    Jiang, Yun-Cheng
    Wang, Ju
    Deng, Pei-Min
    Tang, Yong
    Zhou, Sheng-Ming
    Jisuanji Xuebao/Chinese Journal of Computers, 2009, 32 (07): : 1280 - 1290
  • [13] Dynamic Reasoning for Description Logic Terminologies
    Ustymenko, Stanislav
    Schwartz, Daniel G.
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2010, 6085 : 340 - +
  • [14] Reasoning with individuals for the description logic SHIQ
    Horrocks, I
    Sattler, U
    Tobies, S
    AUTOMATED DEDUCTION - CADE-17, 2000, 1831 : 482 - 496
  • [15] Individual reuse in description logic reasoning
    Motik, Boris
    Horrocks, Ian
    AUTOMATED REASONING, PROCEEDINGS, 2008, 5195 : 242 - 258
  • [16] Description Logic reasoning with syntactic updates
    Halashek-Wiener, Christian
    Parsia, Bijan
    Sirin, Evren
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2006: COOPIS, DOA, GADA, AND ODBAS, PT 1, PROCEEDINGS, 2006, 4275 : 722 - 737
  • [17] Ordering Heuristics for Description Logic Reasoning
    Tsarkov, Dmitry
    Horrocks, Ian
    19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05), 2005, : 609 - 614
  • [18] Reasoning on XBRL metadata in description logic
    Wang, Dong
    Pan, Ding
    Zhang, Yingmin
    Information Technology Journal, 2013, 12 (24) : 8000 - 8004
  • [19] Concept constructing in the description logic SROIQ based on minimal RDF reasoning
    Kanciwa K.
    Nagai T.
    Transactions of the Japanese Society for Artificial Intelligence, 2020, 35 (01)
  • [20] A Defeasible Reasoning Approach for Description Logic Ontologies
    Moodley, Kody
    Meyer, Thomas
    Varzinczak, Ivan Jose
    PROCEEDINGS OF THE SOUTH AFRICAN INSTITUTE FOR COMPUTER SCIENTISTS AND INFORMATION TECHNOLOGISTS CONFERENCE, 2012, : 69 - 78